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Precautionary measures 


Major African campaigns targeting malaria and HIV could help millions, but key concerns over 


their long-term effects should not be forgotten. 


ore than one million healthy children in Africa participated 
M in a bold initiative this year when they took antimalarial 

drugs during the rainy season. The hope is that a few months 
of preemptive doses could help them to fight off a disease that kills 
some 600,000 people throughout Africa each year. At the same time, 
health campaigns in 14 nations in eastern and southern Africa are 
circumcising millions of men in an attempt to stem the spread of HIV, 
which infects more than one-quarter of people aged 15-49 in some of 
those countries. The malaria and HIV efforts have the potential to gain 
the upper hand on two stubborn health scourges, but only if funders 
and organizers heed the lessons of past failures. 

Nature this week publishes two reports from the ground. They are 
written by reporters who travelled to Africa to examine the benefits of 
these health campaigns — as well as concerns that have emerged about 
them. In the malaria effort (see page 186), one of the biggest worries 
has been that giving medications to children will promote the spread 
of resistant forms of the malaria parasite, quickly rendering the drugs 
ineffective. That happened during the 1950s and 1960s, when doctors 
performed prevention experiments for malaria in Africa and South 
America. The current campaign is based on trials that started in 2002 
and took steps to avoid spreading resistance to front-line treatments 
by providing a cocktail of older antimalarial drugs, and only during 
the rainy season. 

The success seen in the clinical trials, however, is not guaranteed 
as the programme is scaled up to cover possibly more than 20 million 
children in parts of Africa. Six nations started giving antimalarials this 
year, but treated just a fraction of their intended recipients because 
of funding and organization problems. This raises concerns that the 
programmes will not carry out the ancillary monitoring efforts needed 
to ensure success. Funding must be provided to track whether the 
large-scale prevention campaign reduces the number of malaria cases 
as hoped, and to ensure that resistant forms of the parasite do not 
spread more quickly than anticipated. The problem is that funders are 
typically less interested in supporting follow-up studies than in testing 
ideas and carrying out interventions. And monitoring has long been 
a weak point for malaria: global surveillance catches just 10% of the 
estimated global malaria cases each year. 

In the circumcision campaign (see page 182), several trials showed 
that the procedure reduces the risk of HIV transmission from women 
to men bya substantial 50-60%. That level of protection is so large that 
international aid organizations, the United Nations and donor coun- 
tries such as the United States have poured more than US$100 million 
into campaigns that seek to circumcise 20 million men in 14 target 
countries by 2015. But some researchers worry that the massive 
programmes will not yield the same benefits as the small, intensive 
trials. A prime concern is the message that men and women are get- 
ting about the effectiveness of the procedure. Campaign organizers 
have mounted major advertising efforts to encourage more acceptance 


of circumcision. In Zambia, billboards proclaim that a circumcised 

individual is ‘a man who cares’ In Tanzania, he is ‘60% more man. 
The advertisements have succeeded in getting millions of men 
through the clinic doors, but the messages have also generated confu- 
sion. Many women in several of the target countries assume incorrectly 
that circumcision helps to protect them against acquiring HIV, and so 
they think that condoms are less necessary than they once were. Several 
studies have shown that men overestimate the amount of protection they 
gain through circumcision. And in Tanzania, 


“Funders are the phrase “60% more mar is taken to mean 
typically less that a circumcised man has more sexual part- 
interested in ners — not the kind of message that will cut 
supporting down on HIV transmission. 
follow-up During the preliminary trials, clinics took 
studies.” steps to avert confusion by providing substan- 


tial counselling about the risks and benefits of 
circumcision before and after the procedure and in periodic follow-up 
visits. Men received testing and treatment for HIV and for several other 
sexually transmitted infections. But in the scaled-up campaign, circum- 
cision providers typically give only one counselling session related to the 
procedure, and another with an HIV test. Behavioural researchers say 
that men require more counselling and that campaign organizers should 
also target information towards women to clear up misconceptions. 
Just as with the preventive malaria drugs, there must be sufficient 
monitoring to track whether circumcision is as effective at reducing 
HIV transmission as it was during the smaller trials. Early signs are pos- 
itive, and that is good news for millions of African men and women. = 


Data deadline 


Time is running out to comment on the NIH’s 
plan for sharing genomic data. 


how science is done in the ‘big data era. In September, the US 
National Institutes of Health (NIH) released draft guidelines 
on the sharing of genomic data. The guidelines, which have been in the 
works for five years, are a necessary and valuable update to the agency’s 
stance on how researchers who receive its funds must share data pro- 
duced by projects that use array-based and high-throughput tech- 
nologies. They cover a huge swathe of research, including sequencing 
human and non-human genomes, genes and gene variants, as well as 
transcriptomic, epigenomic and gene-expression data. 
The issues related to collecting and sharing such data are complex, 


A little-noticed proposal promises to have a huge impact on 


14 NOVEMBER 2013 | VOL 503 | NATURE | 165 


© 2013 Macmillan Publishers Limited. All rights reserved 


| THIS WEEK | EDITORIALS 


and the guidelines touch on most of the controversial topics in large- 
scale biology research today. 

One of the key issues is when to share. The draft policy says that 
researchers must have shared their data by the time those data are 
published in a formal manuscript. However, there are earlier release 
deadlines for some data types, such as raw sequence data from non- 
human organisms and the initial analysis of some human sequence 
data, both of which must be shared within six months of submission 
to an approved repository. This is a thorny issue, and the NIH cannot 
please everybody. Some researchers favour the early release of more 
data, whereas others fear that releasing data ahead of publication will 
leave them vulnerable to being scooped. 

Another major issue is how to protect the identity of those whose 
data are shared — especially as it is now clear that it is possible to 
identify people from anonymous data (see Nature http://doi.org/ 
px4; 2013). The guidelines say that researchers should tell study par- 
ticipants that their data “may be shared broadly for future research 
purposes’, and let them know whether it will be shared through an 
open- or controlled-access mechanism. It asks researchers to gain 
explicit consent from patients who agree to share their data through 
open-access mechanisms. And, importantly, it sets a new bar for 
informed consent on de-identified materials, including cell lines and 
clinical specimens. Such research has historically been exempt from 
informed-consent requirements, but the guidelines ask researchers to 
obtain consent for future research on these materials, too. 

This is a potentially major step, and one that this publication sup- 
ports (see Nature 486, 293; 2012). It is true that some researchers 
who have relied on clinical specimens will see it as an impediment to 
valuable research. But similarly, some advocates of more transparent 
informed-consent rules will not like the fact that the guidelines allow 
researchers to opt out of this requirement if they give “compelling 
scientific reasons” for so doing. 


A third aspect relates to how long the data should be shared for. 
Researchers who rely on controlled-access data sets often complain 
about periodically having to renew their requests for access. The 
guidelines maintain this standard, offering access to such data for one 
year at a time. This is unlikely to please those who have argued that 
legitimate scientists should be able to access larger tranches of data 
and for longer periods of time — although the NIH has responded that 

scientists who take this position are some- 


“Thisis a times not aware of the restrictions on all of 
thorny issue, the data sets that they plan to use (see Nature 
and the NIH 497, 172-174; 2013). 

cannot please Once finalized, the regulations will become 
everybody. ” part of a patchwork of international research 


regulation on the sharing of genomic data. 
The United Kingdom, for instance, is still deciding how much infor- 
mation from its 100K Genome Project will be released and whether 
researchers will be able to access both sequencing results and the rel- 
evant personal health records. At the same time, US open-genomics 
evangelist George Church is expanding his Personal Genome Project 
to Canada and Europe, raising questions such as whether the project 
will be able to access records from centralized health systems (see 
go.nature.com/izmgpo). By contrast, informed-consent regulations 
in other parts of the world are still being developed, leaving a question 
mark over whether the United States will become an easier place for 
genomics researchers to work than other parts of the world. 

In that context, the US proposals will have a major impact on the 
work of Nature’s readers. Yet, according to the NIH’s Office of Sci- 
ence Policy, as of 7 November, just 18 comments had been received 
on the guidelines. That is a poor response to such an important issue. 
The policy will affect many more scientists and Nature urges them 
to submit their responses to the proposals before the deadline of 
20 November. = 


Keep asking 


Prejudice, not evidence, is too often the basis 
for government drug policies. 


week when he told journalists, “Yes, I have smoked crack 
cocaine” — and refused to resign. The reporters smelled blood: 
Ford had long denied drug use, despite repeated rumours. “I wasn’t 
lying,” he said when confronted. “You didn’t ask the correct questions.” 

The debate over the control and regulation of drugs is typified, 
perhaps more than any other in science policy, by a need to ask the 
correct questions. Politicians and the moralizing media tend to seek 
the black-or-white certainty of whether or not a drug poses a threat. 
Researchers often prefer to present risks as relative, and some argue 
that it is hypocritical to proscribe one compound while promoting, 
however tacitly, the consumption of another that may have similar — 
or more potent — effects. 

From time to time the answer changes because of other factors, and 
this is where the question becomes less important than who is asking 
it. The television drama Breaking Bad has brought the abuse of meth- 
amphetamine to popular attention, but the government of Japan, for 
example, has long been well aware of the drug’s effects. In fact, during 
the Second World War it encouraged the manufacture of the com- 
pound and distributed it to the country’s soldiers and civilian work- 
ers in a handy tablet form called hiropon. “For night work and other 
times demanding mental alertness. For overexertion,’ a typical wartime 
advert read. “The most powerful new amphetamine on the market!” 

At the end of the war, Japanese manufacturers sold the stimulant as 


R= Ford, mayor of Toronto, Canada, caused a sensation last 
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acure for all manner of civilian ills, targeting, among others, juveniles 
disturbed by the country’s dramatic post-war social change. Yet within 
a few years, a government U-turn introduced tough laws making the 
drug illegal, with harsh penalties for possession. Official propaganda 
now called on citizens to help the authorities to “wipe out the evil 
of stimulant drugs!”. Widespread abuse and signs of addiction gave 
the authorities a legitimate reason to act. But, as historian Jeffrey 
Alexander of the University of Wisconsin—Parkside pointed out in a 
paper this year (J. W. Alexander Int. J. Drug Policy 24, 238-243; 2013), 
there was another, more sinister motive: the deliberate cultivation of 
a media-fuelled drug panic to justify the arrest and deportation of 
Korean and Taiwanese immigrants, who were disproportionately 
blamed for making and selling methamphetamine. 

Similar social pressures played a part in the crafting of US legislation 
against marijuana, which was first popular with Mexican labourers and 
black musicians. Prejudice is one of a number of issues contributing to 
policies on drugs that are explored by two books reviewed by Andrew 
Robinson on page 194. The books explain, he says, that “a drug’s accept- 
ability to mainstream society fluctuates more owing to social and cul- 
tural trends than to medical knowledge” For example, the United States 
infamously banned alcohol during the prohibition era of the 1920s and 
1930s, a move that would have been unlikely in the United Kingdom 
because of the “complex British attitude to drunkenness”. 

This attitude was highlighted in 2009 by the UK government's 
then-drug adviser David Nutt, who argued publicly that alcohol and 
tobacco were more harmful than LSD, ecstasy and cannabis. His 
opinion earned him the sack. Last week, it also earned him the 2013 
John Maddox Prize for Standing Up for Science, 
which recognizes the promotion of science in 
the public interest, and was set up with the help 
of this publication. Someone has to keep asking 
the correct questions. = 
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evidence, as it should. Yet policy-makers rarely refer to journal 
papers — the actual evidence — instead preferring to base deci- 
sions on summaries, reviews and assessments. 

Such assessments can be deeply influential but fallible: the Inter- 
governmental Panel on Climate Change (IPCC) has effectively 
underpinned climate-change policies worldwide, yet the infamous 
Himalayan glacier blunder in the 2007 report is still regularly 
unearthed to taunt the IPCC’s authority. 

Less likely to make international headlines, but still instructive, are 
problems my colleague Lynn Dicks and I discovered when we reviewed 
the pollinator section of the United Kingdom’s National Ecosystem 
Assessment, a similarly sprawling and influential report that fed into the 
British government's white paper on the environ- 
ment. It states, for example, that “since 1980, wild 
bee diversity has declined in most landscapes”. 
This exaggerates the findings of the cited paper, 
which reports only that bee diversity is lower post- 
1980 than pre-1980 in 52% of 81 10 x 10 kilometre 
grid squares. The report asserts that “decreases in 
pollination services would, therefore, result in 
short-term economic losses for farmers’, but 
it fails to mention that this is based on indirect 
field studies of insect visitors to crops. In fact, no 
economic effects of pollinator decline have been 
detected on a national or sub-national scale. A 
crucial table in the assessment report that esti- 
mates the value of insect-pollinated crops in the 
United Kingdom comes from a PhD thesis, and the analysis behind it 
has not yet been published in the peer-reviewed scientific literature; this 
is not made clear. The same quality-control issues appear elsewhere in 
the report and in the Millennium Ecosystem Assessment. 

The problem of quality control will continue unless we change 
the way in which such assessments are carried out. A good place to 
start is with the new Intergovernmental Platform on Biodiversity and 
Ecosystem Services (IPBES), which will meet early next month in 
Turkey to establish its assessment programme. 

In broad terms, such assessments must be less ambitious. The head- 
line conclusions of these reports are subjected to extensive scrutiny, 
yet they are linked to a vast expanse of text that is impossible to check 
thoroughly. The greater the intended scope of the assessment, and the 
more material that needs to be included for the sake of completeness, the 
higher the risk of error becomes. The pleading from climate scientists 
that the glacier mistake was just one paragraph among 938 pages in one 
of four reports, and was not even noticed for two 


S ociety increasingly demands that policies are based on scientific 


years, has not quietened critics of the IPCC. > NATURE.COM 
Although there might be a desire among some __ Discuss this article 

involved in the forthcoming IPBES assessment _ online at: 

to write freely about many aspects of pollination, —_go.nature.com/owgnqw 


ENVIRONMENTAL 


ASSESSMENTS 


SHOULD USE 
FORMAL 
CONSENSUS 


METHODS. 


= Review by quality not 
; quantity for better policy 


Global assessments need to adopt more rigorous and focused processes 
for collation and review, says William J. Sutherland. 


they should instead concentrate on collating and synthesizing the 
global evidence on just the key issues relating to the main questions. 
Which pollinators are most economically important? What have been 
the changes to their status? What are the main threats? And what are 
the most economically effective means of maintaining and restor- 
ing these pollinators? To support the answers to these questions, the 
assessment must first produce a systematic and publicly available 
review of the literature it will be based on, ideally with accessible sum- 
maries of papers. 

For such reviews, the vision is often the meta-analyses of evidence- 
based medicine, which pool and offer an overview of all studies of 
appropriate quality. This approach is less suited to environmental 
reviews, which typically use a complex mix of experiments, correla- 
tive studies and models with a range of questions, 
response variables and locations that have differ- 
ing relevance to the issue of interest. 

To better synthesize the findings of such 
diverse research, environmental assessments 
should use formal consensus methods such 
as the Delphi technique, which is an iterative 
process of consensus-seeking that is based on 
sequential rounds of confidential scoring and 
commenting on anonymized results. In addi- 
tion, all of the material should be publicly avail- 
able — possibly even including the individual 
final scores of the experts, as happens for the 
members of the UK Monetary Policy Com- 
mittee who meet monthly to decide the official 
interest rate. It would thus be relatively straightforward to trace a con- 
clusion to its assessment of the evidence and then to its evidence base. 

This approach seems to have a range of advantages. It concentrates 
on the few crucial issues but presents them in a more transparent 
and rigorous manner that is likely to provide greater confidence and 
reduce the likelihood of errors. After the evidence is collated, it can 
be updated regularly to allow for quick reassessment (conversely, 
the IPCC assessment is repeated about every six years and is hugely 
expensive). With the bedrock of the evidence assembled and presented 
in a user-friendly way, the evidence on key issues can be continually 
collated and regularly assessed. An equivalent example in medicine 
is Clinical Evidence, which reassesses interventions every six months. 

This approach should be adopted not just by the IPBES and other 
future national or international assessments, but, where possible, by 
the innumerable reports that are written for decision makers. This 
should increase quality, transparency and authority. m 


William J. Sutherland is the Miriam Rothschild Professor of 
Conservation Biology in the Department of Zoology, University of 
Cambridge, UK. 

e-mail: w.sutherland@zoo.cam.ac.uk 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


CLIMATE SCIENCES 


Economic link to 
global warming 


Changes in the rate of global 
warming can be pinned to 
specific episodes in human 
history. 

In a statistical analysis, a 
team led by Francisco Estrada 
of the National Autonomous 
University of Mexico in 
Mexico City looked at global 
mean temperatures since 1880, 
as well as greenhouse-gas 
levels and other factors that 
warm or cool the planet. The 
authors found that economic 
slowdowns during the two 
world wars and in the Great 
Depression temporarily 
slowed the planet's warming. 

The analysis also suggests 
that the Montreal Protocol, 
which phased out chemicals 
that deplete the ozone layer and 
trap heat, has helped to slow 
warming in recent decades. 
Nature Geosci. http://doi.org/ 
p2b (2013) 

For a longer story on this research, 
see go.nature.com/g93gb6 


Burials indicate 
Viking sacrifices 


Vikings may have sacrificed 
slaves to be buried with their 
masters. 

In the 1980s, archaeologists 
working on the Norwegian 
island of Flakstad discovered 
and excavated ten individuals 
from six burials dating to 
between AD 550 and Ap 1030. 
Remains from three multiple 
burials all contained at least 


as 
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Traffic jams follow the laws of physics 


Traffic congestion closely resembles the physics 
of phase transitions, such as when ice melts or a 
metal becomes superconducting. 

Shin-ichi Tadaki at Saga University in Japan 
and his colleagues used a high-resolution laser 
scanner to track cars travelling around an empty 
indoor baseball stadium, then analysed those 


one skeleton lacking its skull. 

To establish the relationship 
between them, a team led 
by Elise Naumann of the 
University of Oslo analysed 
mitochondrial DNA (which 
is inherited from the mother) 
as well as nitrogen and carbon 
isotopes from various parts 
of the skeletons (including a 
tooth; pictured). 

The results suggested that 
individuals in the multiple 
burials did not share maternal 
ancestry, and that individuals 
found without skulls generally 
ate less meat than intact 


individuals in the same graves. 


The differences in ancestry 
and diet imply that the 
beheaded individuals might 
have been slaves, as has been 
suggested for other double 
burials from the Norse world. 
J. Archaeol. Sci. 41, 533-540 
(2013) 
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data as if they were studying phase transitions 
ina material. They found that above a critical 


density of cars, traffic flow became unstable and 


Breast-cancer 
mutations found 


Mutations in the gene 
encoding the oestrogen 
receptor (ER) have been 
identified in aggressive breast 
tumours. 

About 70% of breast cancers 
express the ER, which helps 
to propel rampant growth. 
Several drugs target the effects 
of the sex hormone on its 
receptor, but tumours often 
develop resistance to such 
treatment. 

Independent teams led by 
Sarat Chandarlapaty at the 
Memorial Sloan-Kettering 
Cancer Center in New York, 
Arul Chinnaiyan at the 
University of Michigan in Ann 
Arbor and Ido Wolf at the Tel 
Aviv Sourasky Medical Center 


changed from free-flowing to ajam. 

Scaled up, that density value fits with those 
seen on real-world motorways, the authors say. 
New J. Phys. 15, 103034 (2013) 


in Israel, sequenced tumours 
from a total of 104 patients 
with metastatic breast cancer 
and found ER mutations in 25. 
Further work indicated that 
these mutations could keep 
the receptor active even in the 
absence of oestrogen. 

Drugs that inhibit the 
receptor in other ways might 
stall aggressive tumours, the 
researchers say. 

Nature Genet. http://doi.org/ 
pxp; http://doi.org/pxq (2013) 
Cancer Res. http://dx.doi. 
org/10.1158/0008-5472.can- 
13-1197 (2013) 


Keeping quantum 
secrets 


Quantum cryptographers have 
used devices in Switzerland 
and Singapore to deliver a 
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secret message in the form of 
acommitted bit — the value 
of which is kept hidden until a 
designated time. 

Bit commitment is 
potentially useful for voting 
and other applications in 
which privacy is key, but ultra- 
secure quantum commitment 
schemes have proved difficult 
to implement. 

Felix Bussiéres of the 
University of Geneva in 
Switzerland and his colleagues 
experimentally demonstrated 
a protocol proposed in 2012 
for keeping committed bits 
secret. The technique, which 
relies on the constancy of the 
speed of light, can prevent 
insecure communication 
between remote locations 
for periods of up to 
15 milliseconds, perhaps long 
enough to enable high-speed 
stock trading. 

Phys. Rev. Lett. 111, 180504 
(2013) 


Embryo protein 
has healing power 


A protein that is present 
mainly in developing embryos 
gives some adult tissues an 
exceptional ability to heal. 

The protein Lin28a 
controls growth and 
development in animals from 
worms to humans, but it is 
rarely found in adult tissues 
in vertebrates. 

When researchers led 
by George Daley at Boston 
Children’s Hospital in 
Massachusetts engineered 
mice so that the protein was 
produced into adulthood, 
the animals’ hair grew faster 
and puncture wounds in their 
ears healed more thoroughly. 
Newborn animals were more 
likely to regrow the tips of 
severed digits. 

Lin28a represses an anti- 
growth regulatory program 
and boosts the production of 
many metabolic enzymes. In 
doing so, the researchers say, 
the protein induces a state 
that enhances tissue repair. 
Cell 155, 778-792 (2013) 

For a longer story on this research, 
see go.nature.com/8cejq3 


INFLAMMATORY DISEASE 


Sun synchronizes 
immune system 


Daily light cycles help to set 
animals’ response to infection. 

Lora Hooper of the 
University of Texas 
Southwestern Medical Center 
in Dallas and her colleagues 
showed that a protein that 
controls whether certain white 
blood cells mature is in turn 
controlled by proteins that set 
daily rhythms. When mice 
were exposed to abnormal 
light cycles, rhythm-setting 
proteins sent signals to the 
immune system, prompting it 
to make more inflammatory 
cells in the gut. 

Inflammation protected the 
mice against bacterial infection, 
but it also made them more 
susceptible to inflammatory 
bowel disease and colitis. The 
findings might explain why 
shift workers and others with 
disrupted sleep cycles are more 
prone to immune disorders. 
Science 342, 727-730 (2013) 


Injected gas makes 
Earth rumble 


Earthquakes that shook an 
old Texas oil field over the 
past few years could be due to 
injections of carbon dioxide 
gas. Previous tremors in the 
area were tied to wastewater 
injections, but these are 
the first events attributed 
to forcing pressurized gas 
underground. 

Two years after significant 
gas injections began in the 
oil field to boost production 
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New flu virus found in bats 


> HIGHLY READ 
on plo gin the 


past 30 days 


The flat-faced fruit bat (Artibeus planirostris; 
pictured) in Peru carries a flu virus that 
was previously unknown to science. 


H18N11 is the second influenza A 
virus to be identified in bats, and both were found bya team 
led by Ruben Donis of the Centers for Disease Control 
and Prevention in Atlanta, Georgia. The team found huge 
genetic diversity in bat influenza A viruses: four of eight bat 
viral gene segments harboured more genetic variation than 
did corresponding viral segments in other mammals and 


(pictured), the area 
experienced a spate of small 
earthquakes, 18 of which 
reached magnitudes between 
3 and 4.4. Wei Gan and Cliff 
Frohlich at the University 

of Texas at Austin have now 
determined the precise 
location and timing of the 
quakes. 

The locations correlate with 
gas injection sites and suggest 
that slippage occurred along 
a previously unidentified 
fault. Nearby fields where 
gas was also injected did not 
show such seismic activity, 
however, and the researchers 
say that more data are needed 
to predict where gas injection 
could trigger tremors. 

Proc. Natl Acad. Sci. USA 
http://doi.org/pxs (2013) 

For a longer story on this research, 
see go.nature.com/x4hhn4 


MEDICAL MICROBIOLOGY 


Gut microbes 
linked to arthritis 


Unfriendly bacteria in the gut 
might bring on inflammation 
in the joints. 

Dan Littman at the New 
York University School of 


birds combined. Blood analyses also revealed that 
H18N11 infects several bat species. 

Neither of the bat viruses has 
been found in humans, but the 
discoveries suggest that bats are a 

reservoir for diverse flu viruses 
_ that could fuel new infections. 

PLoS Pathog. 9,e1003657 
(2013) 


Medicine and his colleagues 
collected stool samples 
from people with arthritis. 
Sequencing experiments 

on gut bacteria showed 

that individuals with newly 
diagnosed arthritis were 
more likely to harbour the 
bacterium Prevotella copri 
than were individuals without 
the condition. 

High levels of P copriand 
related species correlated 
with low levels of beneficial 
microbes that are thought to 
suppress the immune system 
and metabolize vitamins 
into forms that are absorbed 
by the bloodstream. When 
inoculated with P copri, 
mice became more sensitive 
to colitis, and the levels of 
beneficial microbes in their 
guts dropped. 

Gut bacteria have long 
been implicated in arthritis, 
but this is the first time that 
researchers have correlated the 
onset of human disease with a 
particular microbial species. 
eLife 2,e01202 (2013) 
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SEVEN DAYS nse: 


Shutdown scars 


Last month’s shutdown of 

the US government, during 
which many ‘nonessential’ 
federal workers were sent 
home, resulted in 6.6 million 
lost workdays and cost 

US$2 billion in staff pay, 

the White House Office of 
Management and Budget said 
on 7 November. The 16-day 
shutdown, which began on 

1 October, also curtailed the 
US Antarctic programme field 
season, cut back influenza 
monitoring and resulted in 
the loss of observing time at 
major telescope facilities. 

See go.nature.com/u5kntz 

for more. 


Fat ban 

On 7 November, the US Food 
and Drug Administration 
(FDA) proposed eliminating 
most artificial trans-fats 

— forms of fat associated 
with increased heart 

risk — from processed 

foods. The agency issued a 
preliminary determination 
that trans-fats in the form 

of partially hydrogenated 

oils should no longer be 
“generally recognized as 

safe’, a status that allows 

a food additive to be used 
without FDA approval. A final 
determination will be made 
after a 60-day public 
comment period. 


Label loss 

Voters in Washington state on 
5 November rejected a ballot 
initiative to require labelling 
of genetically modified (GM) 
foods. The initiative was one 
ofa string of efforts in more 
than 20 states this year to label 
or limit GM foods; California 
voters defeated a similar 
proposition last year. No state 
currently requires labelling of 
all GM foods, although Alaska 
has mandated labels for GM 
fish if they go on sale. 


Record storm ravages Philippines 


One of the most powerful tropical cyclones on 
record, Super Typhoon Haiyan, devastated the 
central Philippines when it made landfall on 

8 November. By some estimates, wind speeds 
topped 310 kilometres per hour as the storm 
reached the coast, placing Haiyan at or near 
the top of the list of strongest known storms. 
The storm surge destroyed much of the coastal 


Ivory crush 

The US Fish and Wildlife 
Service will destroy its stockpile 
of contraband elephant ivory 
on 14 November, officials 
announced last week. Over 
the past 25 years, US law 
enforcement has seized about 
6 tonnes of illegal ivory. The 
tusks will be crushed to send 
a message to ivory traffickers 
worldwide, the agency says. 
In Tanzania, police recently 
seized nearly 800 elephant 
tusks in anti-poaching raids, 
according to media reports 
last week. 


Animal research 

A set of draft guidelines 
committing to less secrecy 

on animal experiments, 

and signed by more than 

50 organizations involved in 
UK life-sciences research, was 
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released for public comment 
last week (see go.nature. 
com/3rqcn4). Major research 
funders and institutions, 
coordinated by the London- 
based group Understanding 
Animal Research, say that 
they want to be more open 
about how animals are used in 
research. But anti-vivisection 
campaigners said that those 
backing the agreement did 
not want true openness about 
“painful and unscientific” 
animal experiments. 


Deep-sea trawling 
European politicians voted 

to ban deep-sea bottom 
trawling in the Atlantic 
Ocean last week, but only in 
areas where ecosystems are 
deemed especially vulnerable. 
The move disappointed 
campaigners who have 


city of Tacloban. Buildings collapsed, and 
flooding ravaged a region still recovering from a 
magnitude-7.1 earthquake last month. Officials 
estimate that at least 10,000 people have died, 
giving Haiyan a second grim title: the deadliest 
storm in Philippine history. In 2012, Super 
Typhoon Bopha killed nearly 2,000 people in the 
southern part of the country. 


pushed for a total ban on this 
method of fishing, which they 
say is highly damaging to 
certain species. The package 
of regulations approved by 
the European Parliament's 
fisheries committee will 

be considered by the full 
parliament in December. 


GM crop stir 

A third genetically modified 
(GM) crop may soon be 
approved for cultivation in 
the European Union. The 
European Commission 

on 6 November asked 
member-state ministers to 
decide whether to authorize 
GM maize (corn) made by 
DuPont Pioneer in Johnston, 
Iowa, which first requested 
approval in 2001. A minority 
of ministers has consistently 
blocked approvals in the past, 


JMA/EUMETSAT 


9; so the commission had given 

8 up referring cases, but its hand 
was forced by a court ruling in 
September after DuPont sued. 
Only a majority of votes in 
favour of rejection will prevent 
the maize from being approved. 


| PEOPLE 
Poisoning puzzles 


Scientists at the University 
Centre of Legal Medicine 

in Lausanne, Switzerland, 
announced on 6 November 
that they had found traces 

of the radioactive metal 
polonium-210 in the exhumed 
body of Yasser Arafat, former 
president of the Palestinian 
National Authority. But 
because Arafat died nine years 
ago it is hard to distinguish 
levels of synthetic polonium 
from background radiation, 
so the results do not clearly 
establish that Arafat was 
poisoned, nuclear physicists 
told Nature. Separately, a 
forensic analysis of exhumed 
Chilean poet Pablo Neruda, 
released on 8 November, 
found no evidence that he 
was poisoned. See go.nature. 
com/mpdrjd and go.nature. 
com/5jqggr for more. 


Energy nomination 
US President Barack Obama 
has nominated physicist 
Ellen Williams (pictured) to 
head the Advanced Research 
Projects Agency-Energy, a 
branch of the Department 


Science projects seeking 


TREND WATCH 


SOURCE: ETHAN PERLSTEIN 


crowdfunding in the past two 
years have included a glow- 
in-the-dark plant that raised 
US$484,013 (see Nature 498, 
15-16; 2013) and an asteroid- 
hunting telescope called Arkyd 
that raised $1.5 million. But an 
analysis of 115 science appeals 
shows that the median amount 
raised is about $3,000, with mean 
donations of $60-100. The data 
were collected by Ethan Perlstein, 
an evolutionary pharmacologist 
in Oakland, California (see 
go.nature.com/3fembu). 


of Energy that funds high- 
risk, high-pay-off research. 
Currently on leave from the 
University of Maryland in 
College Park, Williams has 
been chief scientist for oil-and- 
gas giant BP since 2010. From 
1996 to 2009, she directed 

the University of Maryland 
Materials Research Science 
and Engineering Center. 


Climate politics 
Republican Kenneth Cuccinelli 
lost his bid on 5 November to 
become the next governor of 
Virginia, in an election that 
highlighted scientific issues. 
As the state’s attorney-general, 
in 2010 Cuccinelli launched 
an investigation of climate 
scientist Michael Mann. 
Virginia’s Supreme Court 
later ruled that he lacked the 
authority to demand Mann's 
grant applications, e-mails 
and other documents. During 
the gubernatorial campaign, 
Cuccinelli’s opponent, 
Democrat Terry McAuliffe, 
called him a climate-change 


denier and accused him of 
running a ‘witch-hunt’ 
against Mann. 


| RESEARCH 
Genome access 


An effort to encourage 
people to make their genome 
sequences and medical 
histories public has expanded 
from the United States to the 
United Kingdom. George 
Church, a genomicist at 
Harvard Medical School 

in Boston, Massachusetts, 
who launched the Personal 
Genome Project in 2007, and 
Stephan Beck, a genomicist at 
University College London, 
announced the UK arm on 

6 November; a European 
franchise is on the way in 
2014. See go.nature.com/ 
izmgpo for more. 


Greenhouse gases 


Atmospheric greenhouse- 
gas concentrations reached a 
record high in 2012. According 
to a6 November report by 
the World Meteorological 
Organization (WMO) in 
Geneva, Switzerland, carbon 
dioxide levels climbed last 
year to an average 393.1 parts 
per million (p.p.m.) — 141% 
above pre-industrial levels. 
Long-lived greenhouse 

gases — including CO,, 
methane, nitrous oxide and 
chlorofluorocarbons — rose 
collectively by 2.6 p.p.m. 

to an equivalent CO, 


THE SCIENCE OF CROWDFUNDING 


On four popular crowdfunding websites, 115 successful science 
appeals collected more than US$5 million from 2011 to 2013. 
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SEVEN DAYS | THIS WEEK | 


18 NOVEMBER 
NASAs MAVEN 
mission to Mars is 
scheduled to launch. See 
page 178 for more. 


19-21 NOVEMBER 
In Paris, the 
International Energy 
Agency holds its 
biennial meeting to set 
new strategic priorities. 
go.nature.com/qrimeg 


concentration of 475.6 p.p.m.. 
The observations come from 
the WMO’s global monitoring 
network, which last May 
measured a record daily 
concentration of atmospheric 
CO, at Mauna Loa, Hawaii (see 
Nature 497, 13-14; 2013). 


Biology preprints 


Life scientists have a new 
website for sharing research 
papers before journal 
publication. The site, called 
bioRxiv, was started by Cold 
Spring Harbor Laboratory 
Press in New York, and went 
live on 11 November. Unlike 
the arXiv.org preprint server 
popular among physicists, 
the site will allow readers to 
comment on articles, and will 
include sections devoted to 
different subdisciplines of 
biology. See page 180 for more. 


| BUSINESS 
Rare-disease deal 


Pharmaceutical giant Shire, 
headquartered in Dublin, 
announced on 11 November 
the purchase of drug 
company ViroPharma 

for US$4.2 billion. Based 

in Exton, Pennsylvania, 
ViroPharma develops drugs 
for rare diseases. The bulk of 
its roughly $400 million in 
annual sales revenue comes 
from a drug used to treat a 
rare blood condition known as 
hereditary angioedema. 
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A sample of the thousands of protein structures solved with the help of a US National Institutes of Health programme that has now been closed down. 


Large NIH projects cut 


Budget woes force institutes to scrutinize expensive, non-competitive programmes. 


BY SARA REARDON 


ig science is under big pressure at 
B:: US National Institutes of Health 
(NIH). Gone are the glory days of the 
early 2000s, when a doubling of the agency’s 
budget over five years allowed it to establish 
dozens of programmes with their own large, 
dedicated budgets. Since March, the manda- 
tory US government cuts known as sequestra- 
tion have sliced 5% off the NIH’s already-tight 
budget. Now, institute directors are assessing 
the future of these costly legacy programmes 
—and shutting some of them down. Given the 
climate, directors are wondering whether their 
limited money might be better spent funding 
competitive grant applications, more than 80% 
of which are currently denied. 
“In challenging fiscal times, it’s incumbent 
on us to take a step back and look at how we're 


investing money,” says Jon Lorsch, who took 
over the National Institute of General Medi- 
cal Sciences (NIGMS) in Bethesda, Maryland, 
in August. He says that the budget crunch has 
contributed to his decisions on what to cut, and 
that his institute’s priority should be to fund 
research proposed by investigators, rather than 
programmes designed by the NIH. 

Changes are already under way. In Sep- 
tember, the NIGMS announced that it will 
wind down the Protein Structure Initiative 
(PSI), which since 2000 has received about 
US$70 million per year in dedicated funds 
to develop ways to crystallize proteins and 
predict their structures. And on 1 November, 
the institute announced that its Pharmaco- 
genomics Research Network, which studies 
how genes influence the effectiveness of drugs, 
will lose the protected budget that has brought 
it $404 million over its 13-year lifetime. 


An advisory council to the NIGMS had 
concluded that the PSI was successful but had 
run its course. And the field of pharmaco- 
genomics, which was new when the research 
network began in 2000, has matured enough 
for its scientists to compete effectively with 
grant applicants in other fields, says Lorsch. 
“Now it’s time to let those things fly and be free” 

For investigators who feel that the NIH 
should hand out more small grants, the 
NIGMS’s example is welcome. The PSI, in 
particular, has been controversial. In its first 
phase, the money was distributed between nine 
structural-biology research consortia, each of 
which would go on to produce dozens of pro- 
tein structures per year — with the lofty goal 
of eventually predicting structures for most of 
the amino-acid sequences possible in nature. 

Steven Almo, a crystallographer at the 
Albert Einstein College of Medicine in New > 
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| NEWS IN FOCUS 


> York who leads one of these consortia, says 
that the programme sped up the development 
of technologies such as robotic crystalliza- 
tion systems and automated high-throughput 
structure screening. He adds that all structural 
biology benefits from these advances, as well 
as from the thousands of solved protein struc- 
tures that have been deposited in the PSI’s 
Structural Biology Knowledgebase (Nature 
Publishing Group partners with the NIGMS 
to maintain this database). 

But many of the structures produced by the 
PSI, which were based on theoretical models, 
turned out to be irrelevant to biological func- 
tions, counters Gregory Petsko, a crystallog- 
rapher at Brandeis University in Waltham, 
Massachusetts. If the PSI’s budget had instead 
funded a few hundred individual investigators, 
each competing to work on specific structures, 
they probably would have come up with more 
relevant proteins, he says. He is thrilled that 
the programme is shutting down. “Puta stake 
through its heart, bury it in a coffin filled with 
its native soil — do whatever you can do to 
keep it from rising again,” he says. 

Partly in response to such criticisms, the PSI 
changed its direction in 2010, starting a new 
phase: PSI:Biology, which directed four centres 
to work on structures that biologists believe to 
be particularly important. Almo’s group, for 
instance, is now investigating proteins involved 
in the immune system, to look for molecules 
that could be useful as drugs or drug targets. He 
is disappointed that the PSI is shutting down just 


as the new phase is becoming productive. “We 
have a unique resource and we're in the position 
to start applying this in a powerful way,’ he says. 

But Stefano Bertuzzi, executive director of the 
American Society for Cell Biology in Bethesda, 
says that the NIGMS has made the right deci- 
sion. He thinks that it is better to make strategic 
cuts that eliminate entire programmes than to 
trim all projects and debilitate everyone. 

That approach also suits Story Landis, direc- 
tor of the National 


Institute of Neuro- “Put astake 
logical Disorders throughits 

and Stroke (NINDS) heart, bury it 
in Bethesda, which ina coffin filled 
has already insti- withitsnative 
gated such changes. soil.” 


In 2009, NINDS 

could guarantee funding for only the top 9% 
of grant applications it received. So, Landis 
says, the institute began taking a close look 
at big, long-running legacy projects that had 
begun around 2000, and ended about 15 that 
were unproductive or outdated. This has saved 
the institute some $30 million in the past four 
years. Ending ineffective clinical trials saved 
another $92 million. NINDS now guarantees 
funding for 14% of grant applications. 

“The issue wasn't big science, it’s more 
making sure that we manage our dollars effec- 
tively,” says Landis. Indeed, NINDS will not 
be abandoning big projects any time soon: the 
controversial BRAIN Initiative, which aims to 
develop technologies to map neural circuits, 


could draw $40 million in NIH money in 2014 
— much of it from NINDS. 

Still, a tight budget does seem to be enforc- 
ing higher standards among big projects being 
started across the NIH (see D. Sarewitz Nature 
502, 595; 2013). That is as it should be, says for- 
mer NIGMS director Jeremy Berg, now director 
of the Institute of Personalized Medicine at the 
University of Pittsburgh in Pennsylvania. Berg 
does not believe that more money necessarily 
produces better science: in 2010, he performed 
an analysis which found that middle-sized labs 
with moderate funding are the most productive 
(see Nature 468, 356-357; 2010). 

Yet Berg does not want big science to disap- 
pear. There will always be problems that can be 
solved only by large initiatives, he says — such 
projects just have to have clear and relevant 
goals. “My strong feeling is that the NIH’s goal 
is to try to facilitate the NIH mission, which is 
improving the nation’s health,” he says. “It's not 
ajobs programme for scientists.” 

Lorsch says that more changes can be 
expected at the NIGMS in the coming year, but 
he stresses that it will not bea battle between big 
science and little labs. “People set up a dichot- 
omy of investigator-initiated projects versus 
team science, but that’s not the contrast were 
making,” he says. And there could be room for 
growth in a medium-sized category. Lorsch 
says that the NIGMS is looking at new funding 
mechanisms, such as grants with multiple prin- 
cipal investigators and ‘supergroups’ composed. 
of scientists from different fields. m 


CLIMATE CHANGE 


Warsaw talks to thrash out 
UN climate roadmap 


Costs of reducing emissions may be flashpoints in path towards 2015 Paris treaty. 


BY JEFF TOLLEFSON 


t a major United Nations climate 
Awe in Warsaw this week, a plan is 

being hammered out for negotiations 
on anew climate treaty to be finalized in Paris 
in two years time. Delegates from 195 nations 
are also seeking to obtain commitments from 
countries to limit their greenhouse-gas emis- 
sions between now and 2020. But the path 
forward is rife with disputes between rich and 
poor countries over funding, and how to allo- 
cate and enforce emissions reductions. 

The conference aims to outline the schedule 
and to set parameters for negotiations ahead of 
the next major climate summit in Paris in 2015, 
when countries hope to forge a treaty to follow 


the 2009 agreement settled on in Copenhagen. 

At that meeting, negotiations over a formal 
treaty broke down, but eventually resulted in 
a set of non-binding pledges — the Copenha- 
gen Accord — for emissions reductions until 
2020. The accord also blurred the distinction 
between developed countries, which were 
bound by the 1997 Kyoto Protocol to reduce 
emissions, and developing countries, which 
had no such obligations. Since then, nego- 
tiators have worked on how to structure a new 
framework that would involve climate commit- 
ments from all countries — including China, 
now the world’s largest emitter, and the United 
States, which never ratified the Kyoto Protocol 
(E. Diringer Nature 501, 307-309; 2013). 

The Warsaw talks are split into two main 
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tracks. One focuses on the architecture of a 
new global climate treaty that would take effect 
after 2020, when the current Copenhagen 
commitments expire. The second examines 
what can be done to strengthen commitments 
between now and 2020 to increase the chance 
of limiting global warming to a target of 2°C 
above pre-industrial temperatures (see “Emis- 
sions up in the air?’). 

The European Union (EU), for example, 
has proposed a multi-stage process, whereby 
commitments for climate action post-2020 
would be registered next year and then sub- 
jected to an international assessment to deter- 
mine how well the commitments measure 
up against each other and against scientific 
assessments. The final commitments would 
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> York who leads one of these consortia, says 
that the programme sped up the development 
of technologies such as robotic crystalliza- 
tion systems and automated high-throughput 
structure screening. He adds that all structural 
biology benefits from these advances, as well 
as from the thousands of solved protein struc- 
tures that have been deposited in the PSI’s 
Structural Biology Knowledgebase (Nature 
Publishing Group partners with the NIGMS 
to maintain this database). 

But many of the structures produced by the 
PSI, which were based on theoretical models, 
turned out to be irrelevant to biological func- 
tions, counters Gregory Petsko, a crystallog- 
rapher at Brandeis University in Waltham, 
Massachusetts. If the PSI’s budget had instead 
funded a few hundred individual investigators, 
each competing to work on specific structures, 
they probably would have come up with more 
relevant proteins, he says. He is thrilled that 
the programme is shutting down. “Puta stake 
through its heart, bury it in a coffin filled with 
its native soil — do whatever you can do to 
keep it from rising again,” he says. 

Partly in response to such criticisms, the PSI 
changed its direction in 2010, starting a new 
phase: PSI:Biology, which directed four centres 
to work on structures that biologists believe to 
be particularly important. Almo’s group, for 
instance, is now investigating proteins involved 
in the immune system, to look for molecules 
that could be useful as drugs or drug targets. He 
is disappointed that the PSI is shutting down just 


as the new phase is becoming productive. “We 
have a unique resource and we're in the position 
to start applying this in a powerful way,’ he says. 

But Stefano Bertuzzi, executive director of the 
American Society for Cell Biology in Bethesda, 
says that the NIGMS has made the right deci- 
sion. He thinks that it is better to make strategic 
cuts that eliminate entire programmes than to 
trim all projects and debilitate everyone. 

That approach also suits Story Landis, direc- 
tor of the National 


Institute of Neuro- “Put astake 
logical Disorders throughits 

and Stroke (NINDS) heart, bury it 
in Bethesda, which ina coffin filled 
has already insti- withitsnative 
gated such changes. soil.” 


In 2009, NINDS 

could guarantee funding for only the top 9% 
of grant applications it received. So, Landis 
says, the institute began taking a close look 
at big, long-running legacy projects that had 
begun around 2000, and ended about 15 that 
were unproductive or outdated. This has saved 
the institute some $30 million in the past four 
years. Ending ineffective clinical trials saved 
another $92 million. NINDS now guarantees 
funding for 14% of grant applications. 

“The issue wasn't big science, it’s more 
making sure that we manage our dollars effec- 
tively,” says Landis. Indeed, NINDS will not 
be abandoning big projects any time soon: the 
controversial BRAIN Initiative, which aims to 
develop technologies to map neural circuits, 


could draw $40 million in NIH money in 2014 
— much of it from NINDS. 

Still, a tight budget does seem to be enforc- 
ing higher standards among big projects being 
started across the NIH (see D. Sarewitz Nature 
502, 595; 2013). That is as it should be, says for- 
mer NIGMS director Jeremy Berg, now director 
of the Institute of Personalized Medicine at the 
University of Pittsburgh in Pennsylvania. Berg 
does not believe that more money necessarily 
produces better science: in 2010, he performed 
an analysis which found that middle-sized labs 
with moderate funding are the most productive 
(see Nature 468, 356-357; 2010). 

Yet Berg does not want big science to disap- 
pear. There will always be problems that can be 
solved only by large initiatives, he says — such 
projects just have to have clear and relevant 
goals. “My strong feeling is that the NIH’s goal 
is to try to facilitate the NIH mission, which is 
improving the nation’s health,” he says. “It's not 
ajobs programme for scientists.” 

Lorsch says that more changes can be 
expected at the NIGMS in the coming year, but 
he stresses that it will not bea battle between big 
science and little labs. “People set up a dichot- 
omy of investigator-initiated projects versus 
team science, but that’s not the contrast were 
making,” he says. And there could be room for 
growth in a medium-sized category. Lorsch 
says that the NIGMS is looking at new funding 
mechanisms, such as grants with multiple prin- 
cipal investigators and ‘supergroups’ composed. 
of scientists from different fields. m 
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Warsaw talks to thrash out 
UN climate roadmap 


Costs of reducing emissions may be flashpoints in path towards 2015 Paris treaty. 


BY JEFF TOLLEFSON 


t a major United Nations climate 
Awe in Warsaw this week, a plan is 

being hammered out for negotiations 
on anew climate treaty to be finalized in Paris 
in two years time. Delegates from 195 nations 
are also seeking to obtain commitments from 
countries to limit their greenhouse-gas emis- 
sions between now and 2020. But the path 
forward is rife with disputes between rich and 
poor countries over funding, and how to allo- 
cate and enforce emissions reductions. 

The conference aims to outline the schedule 
and to set parameters for negotiations ahead of 
the next major climate summit in Paris in 2015, 
when countries hope to forge a treaty to follow 


the 2009 agreement settled on in Copenhagen. 

At that meeting, negotiations over a formal 
treaty broke down, but eventually resulted in 
a set of non-binding pledges — the Copenha- 
gen Accord — for emissions reductions until 
2020. The accord also blurred the distinction 
between developed countries, which were 
bound by the 1997 Kyoto Protocol to reduce 
emissions, and developing countries, which 
had no such obligations. Since then, nego- 
tiators have worked on how to structure a new 
framework that would involve climate commit- 
ments from all countries — including China, 
now the world’s largest emitter, and the United 
States, which never ratified the Kyoto Protocol 
(E. Diringer Nature 501, 307-309; 2013). 

The Warsaw talks are split into two main 
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tracks. One focuses on the architecture of a 
new global climate treaty that would take effect 
after 2020, when the current Copenhagen 
commitments expire. The second examines 
what can be done to strengthen commitments 
between now and 2020 to increase the chance 
of limiting global warming to a target of 2°C 
above pre-industrial temperatures (see “Emis- 
sions up in the air?’). 

The European Union (EU), for example, 
has proposed a multi-stage process, whereby 
commitments for climate action post-2020 
would be registered next year and then sub- 
jected to an international assessment to deter- 
mine how well the commitments measure 
up against each other and against scientific 
assessments. The final commitments would 


SOURCE: PBL NETHERLANDS ENVIRON. ASSESSMENT AGENCY/UNEP/CLIMATE POLICY INITIATIVE 


then be registered in Paris in 2015. By getting 
countries to volunteer their climate commit- 
ments and comparing them in this way, the 
hope is that nations with unambitious targets 
might be shamed into strengthening them. 
The EU has also called for a review of pre-2020 
commitments. 

Tasneem Essop, who is tracking negotiations 
for the environmental group WWE in Cape 
Town, South Africa, says that these short- 
term commitments are crucial for pointing 
the world in the right direction. “The biggest 
challenge will be to ensure that emissions do 
peak within this decade,” she says. 

The cost of reducing emissions could be 
the first flashpoint in Warsaw. In Copenha- 
gen, developed countries agreed to provide 
US$30 billion in climate aid from 2010 to 2012, 
and to increase climate support to developing 
countries to $100 billion annually by 2020. 
Although the short-term commitments were 
largely met, there is no clear plan for attaining 
the goal of $100 billion a year. From emerging 
giants such as Brazil and China to poor coun- 
tries in Africa, developing nations are demand- 
ing that wealthy countries ramp up funding 
and create a viable path to this goal. 

With public coffers strapped, many devel- 
oped nations are looking for other funding 
sources. One possibility is to place some type 
of levy on international aviation, which is being 
considered by the International Civil Aviation 
Organization in Quebec, Canada. The body has 
committed to craft an agreement by 2016 that 
could take effect by 2020. 

Negotiators in Warsaw will haggle over 
how to finance and ultimately deploy climate 
aid through organizations such as the newly 
launched Green Climate Fund, based in 
Incheon, South Korea. Another flashpoint is 
the developing countries’ demand for a ‘loss and 
damage’ mechanism to compensate poor coun- 
tries irreparably harmed by climate change. 

But the biggest questions will centre on the 
framework for the treaty in 2015. Before Copen- 
hagen, the emphasis was on a treaty similar to 
the Kyoto Protocol that would lock in legally 
binding emissions reductions. In Copenhagen, 
the United States and other developed coun- 
tries pushed for an alternative that would allow 
individual countries to register commitments, 
which would then be reviewed at an interna- 
tional level. Delia Villagrasa, a senior adviser for 
the European Climate Foundation in Brussels, 
says that the talks are moving towards this bot- 
tom-up approach, which would be combined 
with a formal review to assess commitments 
and identify ways to scale them up. The world 
could get its first hint of what such a system 
might look like as the talks wrap up next week. 

“Warsaw will bring some clarification on 
the structure of the new agreement,” Villa- 
grasa says. “That's not sexy for the media, but 
it’s important.” m 


Additional reporting by Quirin Schiermeier 
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EMISSIONS UP IN THE AIR? 


Countries will discuss how to reduce greenhouse-gas emissions at this week’s climate summit in 
Warsaw. Current commitments expire in 2020 and fall short of climate goals. 


After rising by about 3% annually in the past decade, global emissions of carbon dioxide increased 
by just 1.1% in 2012. 


In 1992, the United Nations Earth 
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Developing countries, led by China, now emit more greenhouse gases than developed countries, 
although their per-capita emissions are much lower. 
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Various trajectories for global emissions are depicted according to their resulting temperature rises 
by 2100. Current commitments to reduce emissions by 2020 are not sufficient to limit warming to 2 °C. 
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Global climate investments by governments, businesses and consumers topped an estimated 
US$359 billion in 2012. Some $39 billion to $62 billion, including $4 billion to $11 billion in 
government aid, moved from wealthy to poor countries. 


40% of all climate finance spent by countries in the 
0 Organisation for Economic Co-operation and 
Development (OECD) entirely within their own borders. 


OECD 
US$218 bn 


9% flows to OECD countries from international 
0 sources, largely from other developed nations. 


15% flows to non-OECD countries from international 
0 sources; OECD-to-non-OECD flows comprise 12% 
of all climate finance. 


NON-OECD 
USS141 bn 


36% of climate finance spent by non-OECD countries 
0 entirely within their own borders. 
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ANNA PANTELIA/CERN 


Discoveries at the Large Hadron Collider, now being upgraded, will inform designs for the next accelerator. 


PARTICLE PHYSICS 


Physicists plan to 
build a bigger LHC 


Accelerator ring would be 100 kilometres around and run at 
seven times the energy of the Large Hadron Collider. 


BY EUGENIE SAMUEL REICH 


hen Europe's Large Hadron Collider 
Wa started up in 2008, particle 

physicists would not have dreamt of 
asking for something bigger until they got their 
US$5-billion machine to work. But with the 
2012 discovery of the Higgs boson, the LHC 
has fulfilled its original promise — and physi- 
cists are beginning to get excited about design- 
ing a machine that might one day succeed it: 
the Very Large Hadron Collider (VLHC). 

“It’s only prudent to try to sketch a vision 
decades into the future,” says Michael Peskin, 
a theoretical physicist at SLAC National Accel- 
erator Laboratory in Menlo Park, California, 
who presented the VLHC concept to a US gov- 
ernment advisory panel on 2 November. 

The giant machine would dwarf all of 
its predecessors (see ‘Lord of the rings’). It 
would collide protons at energies around 
100 teraelectronvolts (TeV), compared with 
the planned 14 TeV of the LHC at CERN, 
Europe’ particle-physics lab near Geneva in 
Switzerland. And it would require a tunnel 
80-100 kilometres around, compared with the 
LHC’s 27-km circumference. For the past dec- 
ade or so, there has been little research money 
available worldwide to develop the concept. 
But this summer, at the Snowmass meeting in 


Minneapolis, Minnesota — where hundreds 
of particle physicists assembled to dream up 
machines for their field’s long-term future — 
the VLHC concept stood out as a favourite. 
Some physicists caution that the VLHC 
would be only a small part of the global par- 
ticle-physics agenda. Other priorities include: 
upgrading the LHC, which shut down in Feb- 
ruary for two years to boost its energies from 
7 TeV to 14 TeV; plans to build an International 
Linear Collider in Japan, to collide beams of 
electrons and positrons as a complement to the 
LHC’s proton findings; and a major US pro- 
ject to exploit high-intensity neutrino beams 
generated at the Fermi National Accelerator 


LORD OF THE RINGS 


Physicists are discussing a proton-colliding machine 
that would dwarf the energy of its predecessors. 


Very Large Hadron Collider (suggested) 
100 km 
100 TeV* 


Large Hadron Collider 
27 km 
14 TeV 


Tevatron (closed) 
Circumference: 6.3 km 
Energy: 2 TeV 


*TeV, teraelectronvolt. 
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Laboratory in Batavia, Illinois. Jonathan 
Rosner, a particle physicist at the University 
of Chicago, Illinois, who convened Snowmass, 
says that these forthcoming projects should 
be the focus. “It’s premature to highlight the 
VLHC? he says. 

In some ways, the interest in the VLHC is 
a sign that particle physicists are returning to 
their roots, pushing to ever higher energies to 
find the fundamental building blocks of nature. 

They will have to justify it, however. The 
discovery of the Higgs particle lends sup- 
port to the idea that some particles have mass 
because they interact with a pervasive, trea- 
cle-like Higgs field. Yet many aspects of the 
discovery are still not understood, including 
why the mass of the Higgs particle is so large. 
One way of explaining its heaviness is through 
supersymmetry theory, in which known 
particles are coupled with heavier ones that 
might be observed in bigger particle collid- 
ers. Although the LHC has not detected any 
signs of supersymmetry, Peskin hopes that a 
hint may come before the end of the decade, 
which would help to inform the design of a 
larger machine. 

One advocate of a bigger machine is Nima 
Arkani-Hamed, a theoretical physicist at the 
Institute for Advanced Study in Princeton, 
New Jersey. In December, he will help to 
launch an institute in Beijing called the Center 
for Future High Energy Physics. Part of its 
explicit mission, he says, is to explore the phys- 
ics that a future proton collider might investi- 
gate. William Barletta, an accelerator physicist 
at the Massachusetts Institute of Technology 
in Cambridge, says that this work is crucial to 
identify a machine size that will maximize the 
science per dollar. “We won't just give hand- 
waving arguments,” he says. 

To build a 100-TeV machine, Barletta adds, 
physicists will need to develop supercon- 
ducting magnets that can operate at higher 
fields than the current generation, perhaps 
20 tesla instead of 14 tesla. One leading can- 
didate material for such magnets is niobium 
tin, which can withstand higher fields but is 
expensive and must be cooled below 18 kelvin. 

CERN is developing its own plans for a 
collider that is similar to the VLHC. CERN 
accelerator physicist Michael Benedikt is 
leading a study of a ‘very high energy large 
hadron collider’ that would pass under Lake 
Geneva. It would have the same key param- 
eters as the suggested VLHC: a circumfer- 
ence of 80-100 km and a collision energy of 
100 TeV. Benedikt suggests that construction 
might begin in the 2020s so that the machine 
could be completed soon after the LHC shuts 
down for good around 2035. “One would 
not want to end up with a huge gap for high- 
energy physics,’ he says. He adds that it is too 
early to offer a price tag. But other physicists 
speculate that a next-generation collider 
would have to cost less than $10 billion for 
the project to be politically plausible. m 
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NASA’s MAVEN spacecraft will measure the effect of solar storms on Mars’ thin outer atmosphere. 


PLANETARY SCIENCE 


Mars mission set 
for launch 


NASA’s MAVEN orbiter aims to unravel the mystery of the 
red planet’s missing atmosphere. 


BY ALEXANDRA WITZE 


mission to Mars to NASA, Bruce Jakosky 

checked to make sure the word ‘aeronomy’ 
hadnt‘ crept into the document. Yes, his satel- 
lite would be studying the planet's wispy upper 
atmosphere, but he knew that NASA would be 
more interested in the links between the van- 
ishing air and the planet's lost surface water. 
“We pitched it not as an aeronomy mission, but 
as a mission to understand the geology,’ says 
Jakosky, a planetary geologist at the University 
of Colorado Boulder. 

The strategy worked. In 2008, NASA gave 
the Mars Atmosphere and Volatile Evolu- 
tion (MAVEN) mission the go-ahead. The 
US$485-million spacecraft is now at Florida’s 
Cape Canaveral, waiting for a 28-day launch 
window that opens on 18 November. If 
MAVEN successfully reaches Mars ten months 
later, it will study how gases in the outer atmos- 
phere escape into space — and how that rate of 
loss has changed over the past few billion years. 

MAVEN comes at a crucial time in NASA’ 
Mars programme. The spacecraft is the only 
US orbiter planned for the foreseeable future, 


Jn: before he submitted a proposal for a 


and it carries the long-term burden of relay- 
ing data from NASA rovers on the surface to 
mission controllers on Earth. The two operat- 
ing US orbiters, Mars Odyssey and the Mars 
Reconnaissance Orbiter, were launched in 
2001 and 2005, respectively. Of the three, 
MAVEN is the most likely to be working by 
the time NASAs next rover heads to Mars in 
2020. The MAVEN project was spared from 
last month's US government shutdown because 
of this crucial relay ability. 

Upon arrival, MAVEN will explore how 
the solar wind interacts with Mars’s upper 
atmosphere, helping to erode it. “MAVEN’s 
job is to obtain some key constraints that will 
allow us to make much more solid arguments 
about how Mars got to be what it is,” says Janet 
Luhmann, a space physicist at the University of 
California, Berkeley, and the mission’s deputy 
principal investigator. 

Canyons carved into the landscape and 
chemical evidence from rocks indicate that 
Mars once had a thick atmosphere that kept 
the surface warm enough for water to flow. 
But most of that atmosphere is gone; today it 
is only about 1% as dense as that of Earth. Mars 
is so small that the gravitational grip it has on 
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its atmosphere is weak. And when its protec- 
tive global magnetic field switched off, around 
4 billion years ago, the planet's hold on its outer 
atmosphere weakened further. 

MAVEN’ instruments will explore several 
of the many ways that atoms in the atmos- 
phere can escape. An ultraviolet spectrograph 
will measure major components of the atmos- 
phere, including the ratio of deuterium (or 
‘heavy hydroger) to hydrogen. The Martian 
atmosphere is enriched in deuterium because, 
unlike lighter-weight hydrogen, the isotope 
had insufficient thermal energy to break free of 
the planet's gravitational pull. Comparing the 
ratio of deuterium to hydrogen in the atmos- 
phere with the ratio locked in ancient rocks at 
the planet’s surface should reveal the rate of 
this particular type of loss over time. 

Other missions to Mars have studied 
another type of loss: the depletion of charged 
particles. In 1988, the Soviet Phobos-2 mission 
made the first detailed measurements showing 
that the solar wind stripped ions from Mars’s 
atmosphere. By 2007, the European Space 
Agency’s Mars Express orbiter had discovered 
that several grams of oxygen and carbon diox- 
ide ions were being lost each second (S. Bara- 
bash et al. Science 315, 501-503; 2007). 

Unlike these missions, MAVEN will be 
able to study a third type of escape: the ero- 
sion of neutral particles, such as hot oxygen 
atoms, that form in the upper atmosphere and 
escape because of their high energies. In addi- 
tion, MAVEN will be able to study changes in 
extreme ultraviolet light flooding from the 
Sun, and watch for differences in the solar 
wind caused by solar storms, to measure the 
effect of solar activity on atmospheric losses. 
Together, these measurements should paint a 
more detailed picture of atmospheric escape 
than ever before, says Jakosky. 

Mars Express will also continue to con- 
tribute. Its researchers and those working on 
MAVEN are planning to make simultaneous 
observations with the two spacecraft. When 
both are in place, their elliptical paths will take 
them from a few hundred kilometres above 
the planet’s surface to thousands of kilome- 
tres away. While one of the probes is deep in 
the atmosphere, the other will have a broader 
perspective from afar. “Having two spacecraft 
capable of focusing on the same subject is very 
exciting,” says Stas Barabash at the Swedish 
Institute of Space Physics in Kiruna, a member 
of the Mars Express science team. 

Mars Express has funds to operate until 
the end of 2016. India’s first Mars mission, a 
methane-sniffing orbiter that launched on 
5 November, will also overlap with MAVEN: 
both spacecraft are due to arrive at Mars in the 
same month. 

MAVEN has a one-year nominal mission in 
which to accomplish its goals. But as it fulfils its 
longer-term communications tasks through the 
years, it may be well placed to watch the Martian 
atmosphere eroding before its electronic eyes. m 
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Social scientists hit 
back at grant rules 


Researchers seek to fend off restrictions on National Science 
Foundation grant programmes. 


BY SARAH ZHANG 


in the United States to question the value of 

social-science research. Studies of anything 
from global social networks to the history of 
conservation in South America have proved 
irresistible to Republicans keen to argue that 
funding would reap greater rewards elsewhere. 
But this year, researchers in the field received 
a sharp shock when those criticisms morphed 
into tangible restrictions. 

“What’s different this time is they suc- 
ceeded,” says Howard Silver, executive director 
of the Consortium of Social Science Associa- 
tions. In March, Congress placed new limits 
on political-science research funded by the 
National Science Foundation (NSF). And 
with powerful Republicans pushing to enact 
legislation that would restrict the NSF further, 
forcing it to support only research that serves 
the ‘national interest, newly emboldened social 
scientists are gearing up for a fight. 

Silver estimates that he has made about 
25 visits to Capitol Hill since March to try to 
ensure that the NSF is still able to support a 
wide range of social, behavioural and eco- 
nomic (SBE) research. And SAGE Publica- 
tions, a major publisher of social-science 
journals based in Thousand Oaks, California, 
has this year poured some US$90,000 into lob- 
bying for sustained NSF funding. 

Researchers say that the March billis already 
biting. The law includes a provision inserted by 
Senator Tom Coburn (Republican, Oklahoma) 
requiring the NSF to certify that all political- 
science projects it supports will benefit national 
security or US economic interests. In July, the 
NSF made a last-minute decision to scrap one 
of its two annual calls for political-science 
grants. The agency has not explained why, but 
many researchers blame Coburn’s amendment. 

“This is catastrophic intervention in peer 
review, says Jane Mansbridge, a political sci- 
entist at Harvard University’s John F. Kennedy 
School of Government in Cambridge, Massa- 
chusetts, who studies the democratic process. 
Mansbridge adds that she will not apply for 
funding in the NSF’s next grant call, in January, 
because she does not think that her proposal 
on the dynamics of negotiations will meet the 
Coburn amendment’s criteria. 


| tis not unusual for conservative politicians 


Coburn’s success seems to have heartened 
other social-science critics in Congress. In 
April, Representative Lamar Smith (Republi- 
can, Texas), the chairman of the House Com- 
mittee on Science, Space, and Technology, 
asked the NSF for details of five social-science 
grants, citing concerns that they did not satisfy 
the agency’s standard for ‘intellectual merit: 
And Smith’s committee is this week holding 
a hearing to discuss legislation he is backing 
that would require the NSF to certify that all 
the grants it awards serve the national interest 
in one of six areas: economic competitiveness, 
health and welfare, scientific literacy, partner- 
ships between academia and industry, promo- 
tion of scientific progress or national defence. 

It is not just the idea of national interest that 
is concerning lobbyists. Smith’s bill does not 
include budget numbers, normally the centre- 
piece of authorization legislation. But it would 

prevent NSF divi- 


“Thisis sions other than the 
catastrophic SBE directorate from 
interventionin funding social-sci- 


ence research unless 
it is of ‘higher prior- 
ity than all other research. That has left many 
wondering whether Republicans ultimately 
plan to cut the SBE directorate’s budget. 

Others say that the requirement to serve 
the national interest could steer the NSF away 
from basic research. “They’re sending a mes- 
sage to the NSF: “We want you to change your 
mission, says Michael Lubell, director of pub- 
lic affairs for the American Physical Society. 

In their visits to Capitol Hill, lobbyists are 
stressing to lawmakers and their staff the far- 
reaching effects of NSF-funded research. They 
cite, for example, an algorithm that is now used 
to match donated kidneys to transplant patients. 

But David Stonner, who directed the NSF's 
congressional-affairs office from 1991 to 
2007, worries that social scientists are taking 
the wrong approach. They need to highlight 
private industries, such as social-media com- 
panies, where the economic benefits are real- 
ized, he says. Others worry that such a tack 
will only reinforce prejudice against basic 
research. Myron Gutmann, former head of the 
NSF's social-science programmes, says simply: 
“The NSF is not the ‘National Industry Support 
Foundation? = 


peer review.” 
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Preprints come to life 


A dedicated website for sharing biology papers before peer review leaves journals divided. 


BY EWEN CALLAWAY 


hat are biologists so afraid of? 
Physicists, mathematicians and 
social scientists routinely post their 


research to preprint servers such as arXiv.org 
before publication, yet few life scientists 
follow suit. 

A website that goes live this week is hoping 
to change that. The site, bioRxiv.org, launched 
by Cold Spring Harbor Laboratory Press in 
New York, bills itself as “the preprint server for 
biology” It will operate similarly to arXiv, with 
scientists depositing papers as soon as they are 
ready to share them, weeks or months before 
formal publication. 

The site’s launch comes as biologists begin 
to embrace preprints (see Nature 488, 19; 
2012). ArXiv’s quantitative biology section, 
launched in 2003, has seen exponential gains 
in the past two years (see ‘Life sciences freed’). 
It is on target to attract more than 1,500 
manuscripts this year, says Paul Ginsparg, a 
physicist at Cornell University in Ithaca, New 
York, who started arXiv in 1991 and serves 
on bioRxiv’s advisory board. Meanwhile, the 
open-access journal Peer] launched its own 
biology preprint server in April and has drawn 
some 90 submissions so far. 

Geneticists in particular have flocked to 
arXiv’s quantitative-biology repository, and 
papers posted there on topics such as the 
genetic ancestry of southern Africans and Euro- 
peans have gone on to be published in promi- 
nent journals like Nature Communications and 
PLoS Biology. Scientists studying infectious dis- 
eases have also used the site to rapidly report on 
ongoing outbreaks, such as the H7N9 influenza 
infections in China in early 2013. 

But arXiv does not plan to expand into 
other areas of the life sciences, such as cell and 
molecular biology, and the site is not designed 
with biologists in mind. Richard Sever, who 
founded bioRxiv with colleague John Inglis at 
Cold Spring Harbor Laboratory Press, says: 
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“This is not something that is competing with 
arXiv. It is supposed to complement it” 

BioRyiv, which operates in conjunction 
with HighWire Press in Stanford, California, 
is intentionally minimalist. “It’s designed to be 
basic and functional because it’s amechanism 
for sharing manuscripts,” says Sever. 

Papers — there are nine already, most of 
which had previously been posted to arXiv 
— will be grouped into some two dozen sub- 
disciplines, including cancer biology and 
immunology. Individual papers come with 
unique digital object identifiers (DOIs), mak- 
ing them easy to cite, and are described as new 
results, confirmatory results or contradictory 
results. Researchers can discuss papers in com- 
ments that appear alongside articles; these will 
be moderated. 

At present, Cold Spring Harbor Labora- 
tory is paying for the site, but Sever and Inglis 
declined to disclose the cost of the site’s launch 
or its annual budget. They will look for ongo- 
ing sources of revenue, such as foundation 
grants, and plan to keep the site free to users. 
ArXiv’s annual budget is around US$750,000, 
a large proportion of which comes from the 


Simons Foundation, a non-profit organization 
in New York, with universities and others mak- 
ing smaller contributions. 

Leonid Kruglyak, a geneticist at the Univer- 
sity of California, Los Angeles, and a member 
of bioRxiv’s advisory board, expects many col- 
leagues to adopt the site. Barriers that previ- 
ously prevented biologists from using preprint 
servers seem to be eroding, he says. Most quan- 
titative geneticists are no longer afraid of get- 
ting scooped by posting preprints of their work 
to arXiv, with many now seeing the process as a 
way to establish priority for their work. 

Many journals, such as those published by 
the Genetics Society of America and the Eco- 
logical Society of America, have changed their 
policies in recent months to allow the practice. 
Nature Publishing Group journals allow pre- 
prints, and Science accepts them in many cases. 
Cell Press, which publishes biology stalwarts 
such as Cell and Neuron, discourages preprint 
posting. But “you can easily imagine a situa- 
tion not too long from now, where having your 
journal not be preprint-friendly is an active 
disincentive to publishing there’, Kruglyak says. 

However, Martin Fenner, a haematologist at 
Hanover Medical School in Germany, thinks 
that many life scientists will be more comfort- 
able with open peer review, which is avail- 
able from journals such as F1000Research. In 
open peer review, publishers post submitted 
draft manuscripts to a journal’s website before 
acceptance. Compared with preprints posted 
to a third party such as bioRyiv, this process 
results in a shorter lag time between first draft 
and final publication, and the different ver- 
sions are clearly linked, Fenner says. 

Graham Coop, an evolutionary geneticist 
at the University of California, Davis, posts 
his labs papers to arXiv and has already cross- 
posted one to bioRyiv. Peer pressure will 
prompt researchers in other fields to follow suit, 
he says. “If people see papers from people they 
know appearing on sites like the bioRyiv, they 
naturally feel more comfortable.” m 
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Africa’s circumcision 


CHALLENGE 


To combat the spread of HIV, health officials plan to circumcise 20 million 
men in Africa, but some have concerns about the aftermath. 


BY CATHERINE DE LANGE 


arvin is 22 years old, single and — like many men of his age 

— thinking about girlfriends. But for the next six weeks, he 

will have to give his love life a rest. On this sunny morning 

in the Zambian capital, Lusaka, Marvin is waiting to get 
circumcised. Along with three of his friends, he sits tensely on a bench 
outside the operating theatre, a room usually used for abortions. Are 
they nervous? Marvin cracks a wide smile. “A little,” he says. 

In Lusaka, young men like Marvin have grown up with daily remind- 
ers of the risks that can accompany sexual relationships. For years, bill- 
boards and adverts painted on walls have espoused the importance of 
safe sex and testing for HIV, which currently infects more than one in 
five Lusaka residents. But more recently, a new message has been pop- 
ping up in public spaces and the media: a call for men to get circumcised 
to reduce their chances of contracting the virus. Marvin's friends told 
him about the procedure. “A lot of them have been circumcised so I 
thought of doing the same thing,” he says, adding that it may increase 
his odds with women. “I hear that the first thing they ask you is ‘have 
you been circumcised?” he says. But, ultimately, he hopes the surgery 
will help him “just to stay healthy”. 

Scenes like this have become familiar across much of southern and 
eastern Africa. Since 2007, 14 nations have taken part in a massive public- 
health campaign aimed at circumcising millions of men in an attempt 
to drastically reduce the spread of HIV (see ‘Making the cut’). About 
3 million men in the region have been circumcised since the start of the 
campaign, and the initiative was made a high priority in late 2011 by 
the World Health Organization (WHO), the Joint United Nations Pro- 
gramme on HIV/AIDS (UNAIDS) and the US President’s Emergency 
Plan for AIDS Relief (PEPFAR) — which are funding part of the pro- 
gramme. The rest is being provided by the Bill & Melinda Gates Founda- 
tion, the World Bank and other global health organizations. The targets 
are ambitious: 80% of men of reproductive age in these countries need to 
be circumcised by 2015. That means more than 20 million circumcisions. 

If these efforts succeed, the payoff could be considerable. Reaching the 
80% goal could cut the number of new HIV infections in the target coun- 
tries by as much as half, says Bertran Auvert, a public-health specialist at 
the University of Versailles Saint-Quentin-en- Yvelines in Paris. “The goal 
is to have a huge impact on the overall HIV epidemic in Africa” 

But some scientists worry that the benefits reported for circumcision 
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in clinical trials will not bear out to the same 
degree when scaled up to tackle a messy epi- 
demic that is spread as much through behaviour 
as biology. In particular, they say that men are getting mixed messages 
about the benefits of circumcision. Another concern is the effect on 
women, who gain no direct protection from the circumcision campaign, 
and may even face greater risk. Critics say the programmes could increase 
the risk of HIV infection in some populations by encouraging people to 
engage in risky sexual behaviours, such as forgoing condoms. 

“It's only been shown to decrease to some degree transmission from 
the female to the male, and yet we are acting as if it’s the only thing to do,” 
says Philip Thuma, a doctor who runs the Malaria Institute at Macha 
in southern Zambia. “Our job as scientists is to step back and be a bit 
sceptical. Should we really be putting this much emphasis on something 
at the expense of other things that are just as important?” 


Circumcision packets 
ata clinic in Zambia. 


POINT OF ENTRY 

The idea that circumcision might reduce the risk of HIV infection is 
often traced back to Valiere Alcena, a physician now at the Albert Ein- 
stein College of Medicine in New York City, who noted in 1986 that 
the men of Haiti and Central Africa are usually uncircumcised and as 
a result often develop lesions on the foreskin that serve as a potential 
entry point for AIDS’. The idea was cemented when a landmark study” 
in 1989 showed that of some 400 Kenyan men who visited a group of 
prostitutes with high rates of HIV, those who were not circumcised had 
more than eight times the risk of becoming infected than those who 
were. Michel Garenne, a demographer at the Pasteur Institute in Paris, 
remembers being at a meeting when the paper was first presented, and 
it seemed to make sense. “I was working in Senegal, where there was 
very little AIDS and a lot of circumcision, so I became quite convinced 
that something was going on,’ he says. 

But others disagreed, arguing that some populations did not fit 
this pattern and that the correlation could be 
explained by other factors relating to the preva- 
lence of circumcision, such as religious practice. 
Garenne himself became sceptical. In 1994, he 
was working in South Africa, where he founda 
huge HIV epidemic even though many men were 
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circumcised. “I changed my mind and became convinced that circumci- 
sion will never help in controlling HIV/AIDS? he says. 

Those who did believe in the protective effect of circumcision called 
for a systematic analysis to settle the matter. Three randomized clinical 
trials were carried out in southern Africa between 2002 and 2007. One 
study’, led by Auvert, offered medical circumcision to men ina region of 
South Africa and compared them with a non-circumcised group. After 
21 months, there were 20 cases of HIV in the circumcised group and 49 
among the uncircumcised men, equating to a 60% reduction in HIV risk. 

Two other trials, in Kenya* and Uganda® , produced similar results. “It’s 
extraordinary in any public-health measure that you get such consist- 
ency of results,” says Robert Bailey, an epidemiologist at the University 
of Illinois at Chicago, who led the Kenyan study. 

The WHO also found the consistency compelling and declared the 
results “an important landmark in the history of HIV prevention’ In 
2007, the WHO, UNAIDS and PEPFAR recommended implementing 
voluntary medical male circumcision (VMMC) in 14 high-risk African 
countries. 


‘A MAN WHO CARES’ 

Zambia, with its high prevalence of HIV, low circumcision rates and 
mixed attitudes towards circumcision, is a microcosm of the broader 
campaign. During the past five years, it has seen the progressive roll- 
out of catchy, pro-circumcision slogans on posters, television and radio 
campaigns. Health-service providers have also developed outreach pro- 
grammes to spread the message. 

At the clinic where Marvin is waiting, visitors are greeted by an adver- 
tisement for its services: a silhouette of a man standing proudly with his 
hands hooked inside his belt and, above it, a slogan reading: “Male Cir- 
cumcision, a man who cares”. Clinics that perform the procedure hope 
the belt-hook stance will become an unspoken symbol, a way for men to 
implicitly communicate their circumcised status, which is increasingly 
viewed as an attractive attribute. 

If the campaigns can convince enough men to take part, the pro- 
gramme could significantly reduce HIV-transmission rates, accord- 
ing to a number of modelling studies. These project that circumcising 
80% of men aged 15-49 years old in targeted countries by 2015 would 
decrease the incidence of HIV by 30-50% over 10 years, amounting to 
some 3.4 million fewer new infections. Carrying out these 20.3 million 
circumcisions by 2015, plus an additional 8.4 million over the follow- 
ing 10 years, would cost about US$2 billion, but would potentially save 
$16.5 billion by 2025 owing to avoided treatment and care costs®. 

There may be other benefits too. Before men undergo voluntary 
medical circumcision, many get an HIV test, which means more cases 
of HIV will be picked up, and thus more men will be referred for treat- 
ment. “I think circumcision is going to have a very significant impact 
because it’s being delivered as a package,” Bailey says. 

A big part of that package is a one-on-one counselling session run by 
people such as Jonathan Kabanda, a counsellor for the Society for Family 
Health in Livingstone, about five hours’ drive south of Lusaka. It is his 
job to make sure that men who come in for circumcision understand 
the benefits and the risks, not only for themselves but also for their 
partners. On a Saturday morning, Kabanda sits down and arranges his 
materials, among them three large wooden phalluses — one equipped 
with a retractable fabric foreskin — and some condoms. He pulls out a 
small flip chart that details different matters relating to circumcision — 
starting with how the foreskin can increase the transmission of HIV and 
leading on to the benefits and limitations of circumcision. 

Those who support the campaigns, including the WHO, PEPFAR and 
the authors of the clinical trials, stress the importance of making sure 
that patients understand that circumcision provides limited protection 
and that men should continue to use condoms. But critics ask whether 
the message is getting through. There is certainly room for confusion. 
Although Kabanda’s presentation is thorough, one page of the flip chart, 
titled “MC does not offer 100% protection’ shows a picture of a man 
chasing two women down the street while throwing his condoms away. 


14 NOVEMBER 2013 | VOL 503 | NATURE | 183 


© 2013 Macmillan Publishers Limited. All rights reserved 


CATHERINE DE LANGE 


| NEWS FEATURE 


MAKING THE CUT 


Since health trials in Africa demonstrated that circumcision lowers rates of HIV transmission, donor nations and charities have poured more than US$100 million into 
circumcision campaigns in 14 African nations. But they have a long way to go to reach the goal of 20 million circumcisions by 2015. 
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Other benefits are overstated: “You are preventing your partner from 
cervical cancer,” Kabanda says, when, in fact, the reduction in risk is far 
from complete. 

Outside the clinic in Lusaka, it is clear Marvin has also received mixed 
messages from his counselling session. “We were told there was a sixty- 
something per cent of being... I don’t know if it’s from HIV or some 
sexually transmitted disease,’ he says. “And the 45%, I don't know what 
it is; I didn’t get much about that. I think I'll try to ask later.” 

Many men may face similar confusion. As part of the procedure, par- 
ticipants get only one counselling session on the matter of circumcision 
and one with their HIV test. By contrast, the men in the clinical trials had 
sessions before and after circumcision, and every six months during the 
21-month follow-up period. Some public-health advocates argue that the 
protocols used in the trials should be replicated closely in the campaigns, 
otherwise they will not be as effective. “When technologies like male cir- 
cumcision are demonstrated effective, everybody forgets about the rest 
of the protocol and just looks at the medical procedure,’ says Seth Kalich- 
man, a clinical psychologist at the University of Connecticut in Storrs who 
studies behavioural responses to HIV-prevention campaigns. “We need to 
think about the standard of care within which a procedure is implemented 
if we're going to hope to get the effects that we saw in the trial.” 

However, in a campaign that aims to reach millions of men, it would 
be hard to get participants to return for regular counselling. And some 
researchers, such as Auvert, dismiss the importance of counselling, argu- 
ing that it has little impact on behaviour. “If counselling was enough to 
change sexual behaviour to reduce the risk of acquiring HIV, we would 
have stopped the HIV epidemic in Africa a long time ago,’ he says. 


RISK ASSESSMENT 

The question of behaviour has become a crucial one in the debate over 
circumcision. A chief concern is that men and women who believe 
that they are less likely to acquire HIV will engage in more risky sexual 
behaviour. This means that circumcision “could have the opposite effect 
in the long run’, Garenne says. “People might be more likely to get HIV 
than if they were not circumcised at all” 

Studies of this phenomenon — known as risk compensation — are 
just starting to shed light on how circumcision alters behaviour, and so 
far the findings are mixed. In one survey’ conducted in Kenya directly 
before the circumcision scale-up took place, Bailey and his colleagues 
reported that one-fifth of men and women they interviewed said that 
condom use is less necessary and that HIV is a less serious threat because 
male circumcision is available. 

These misconceptions could put women at particular risk. Erika 
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Layer, a health-communications researchers at the Johns Hopkins Uni- 
versity in Baltimore, Maryland, explored that possibility by interviewing 
33 women following a circumcision campaign in the city of Iringa in 
Tanzania. She found an overwhelming preference for circumcised men'; 
women believed them to be cleaner and better educated. Some women 
also assumed — incorrectly — that the reduced risk of contracting HIV 
applied to women, too. As one woman who was interviewed for the 
study put it: “Circumcised men who are HIV-positive can have sex with 
women and the women will not get the infection easily.” 

Much of the research on risk compensation relies on self-reported 
behaviours and often involves qualitative studies. But Layer’s interviews 
showed consistent misunderstandings about circumcision, a finding 
borne out by studies in other countries that suggest women are less 
likely to require circumcised men to use a condom. “There is evidence 
in South Africa, Kenya and Tanzania that shows that generally women 
are overstating the benefits of circumcision and believe the risk to be 
lower than it actually is,’ she says. 

Other studies, however, suggest that circumcision does not lead to more 
risky behaviour by men and women. Earlier this year, Auvert published 
findings from a survey’ in the same region as his original trial in 2005. He 
found that despite a high uptake of circumcision, there was no reported 
difference in sexual behaviour — for instance in condom use or number 
of partners — between circumcised and uncircumcised men, and the esti- 
mated HIV incidence rate was around 60% lower among the circumcised 
group. Similarly, when Bailey and his colleagues did a follow-up study in 
Kenya, they found no increase in risk associated with the circumcision. 

Researchers are at odds over the implications. “Any public-health 
measure will have the potential for risk compensation, but there is no 
evidence at all that risk compensation is taking place and indeed there 
is some evidence that the opposite is occurring,” Bailey says. 

Kalichman disagrees. He believes that men are getting mixed mes- 
sages, and that it doesn’t make sense that they would agree to circumci- 
sion if they were going to consistently use condoms, which offer a much 
higher level of protection. “Why bother with the procedure if you have 
to continue to use condoms?” he asks. “The whole point is, ‘if I have the 
procedure I don't need to.” 

The take-home message on risk compensation, he says, is that “we 
don't know the extent of it and we also don't know the impact of it” It is 
possible, he adds, that the overall effect is minor because the 60% risk- 
reduction offered by circumcision is already so substantial. 

Nonetheless, Layer says it is important to make sure women are also 
targeted by clear messages about the limitations of circumcision. “If we 
do a little bit more to try to engage women I think we could reduce the 
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Billboard adverts and family-planning programmes are helping to spread the message about the benefits of male circumcision. 


immediate and the long-term risk for both women and men,’ she says. 

Circumcision advocates say that in the long run, a general reduction 
in HIV prevalence in the population will ultimately benefit women, 
even if some women are placed at greater risk in the short term. “Iam 
absolutely convinced that some women will become HIV positive 
because of the circumcision of their partners,’ says Auvert. However, he 
adds that he will publish some preliminary results soon showing that the 
impact on women is extremely small. “We are not concerned,” he says. 

The issue of balancing individual risks against widespread benefits 
also comes up in studies looking at how many men have sex during the 
healing period after circumcision, when open wounds and unhealed 
stitches increase the risk of transmission. In the counselling session, 
men are told to abstain from sex for six weeks 
or, if they can't wait, to use a condom. How- 
ever, a study’® by Paul Hewett of the Population 
Council in Lusaka and his colleagues found that 
around one-quarter of men said they resumed 
sex prematurely; of those, 82% engaged in at 
least one act of unprotected sex, and a number 
had unprotected intercourse with two or more 
partners. When the team modelled the effect 
of this on HIV transmission, it estimated that if 
61,000 men were circumcised in one year, early 
resumption of sex would result in a total of 69 
extra HIV infections. But the overall net effect was protective, with 230 
fewer HIV infections in one year, predominantly among men. 


TARGET DRIVEN 
Both critics and advocates of the circumcision campaigns agree that it 
willbe a struggle to reach the 20-million mark by 2015, given that so far 
countries have only reached 15% of that overall goal. At health clinics 
across Zambia, much of the conversation is about targets, says Layer. 
“This seems to be the trend — do whatever it takes to get men in and 
out that door,’ she says. So circumcision providers are coming up with 
innovative ways to recruit more men. The US embassy in Zambia has 
launched a new effort to work with chiefs of traditionally non-circumcis- 
ing tribes in the hope that they will “convince their own people that even 
though this isn’t a part of their historic tradition this is the right choice’, 
says Mark Storella, the US ambassador to Zambia. And mobile clinics 
travel to hard-to-reach areas to recruit men. “There’s a lot of pressure 
right now in this country because of PEPFAR and others,” says Thuma. 
The recruitment drive is happening across the region. In Kenya, some 
service providers are even offering ‘moonlight circumcisions’ at night, 


“THERE IS NO 
EVIDENCE AT 
ALL THAT RISK 
COMPENSATION IS 
TAKING PLACE.” 


or financial incentives for referrals. More money is also being invested, 
but this has put significant pressure on local service providers to operate 
on more men. The Bill & Melinda Gates Foundation, based in Seattle, 
Washington, has implemented a pay-for-performance model in Zambia, 
for instance, in which organizations are reimbursed on the basis of their 
ability to reach a target number of circumcisions. “I think you have to be 
concerned about the impact on quality of such an incentives approach,” 
says one person familiar with the programme who wished to remain 
anonymous. He wonders whether it could cause circumcision providers 
to cut costs for other essential services, such as counselling. Ina state- 
ment to Nature, Sema Sgaier, a senior programme officer with the Gates 
Foundation, wrote “to ensure that VMMC delivery does not affect estab- 
lished standards of care, it is always delivered as 
one component of a comprehensive package of 
services”. These include HIV counselling and 
testing as well as screening and treatment for 
other sexually transmitted diseases. 

Everybody from donors to doctors in the 
clinic agrees that the circumcision effort must 
be part ofa broader strategy to strangle Africa’s 
HIV epidemic. And although the 20-million 
goal remains distant, the campaigns could still 
bring major benefits, says Bailey. “Even if you 
dont reach the 80% target that’s not to say you're 
not going to have a large impact on the number of new infections,” he 
says. “Every circumcision still has a preventive effect.” 

For people living in the epicentre of the HIV epidemic, that protective 
effect — however limited — is an attractive offering. Marvin, for one, 
is confident in his decision. “I think, for me,” he says, “it’s going to bea 
good thing.” m SEEEDITORIALP.165 


Catherine de Lange is an editor with Nature in London. She travelled 
to Zambia on a fellowship from the International Reporting Project at 
Johns Hopkins University in Baltimore, Maryland. 
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A race against 


ESISTANCE 


Several African nations could strike a major 
blow against malaria by sacrificing the efficacy 
of some older drugs. Can they make it work? 


BY AMY MAXMEN 


t is September in southeastern Mali, and 

Louka Coulibaly is standing in the shade of 

a squat, concrete building, giving instruc- 

tions to a dozen men and women perched 
on a wobbly wooden bench. Coulibaly, a local 
medical supervisor, hands out nylon back- 
packs, each filled with bags of pills, plastic 
cups and a porcelain mortar and pestle that 
the women pause to admire. By noon, the 
men and women are packing up and head- 
ing back to their respective villages on foot, 
bicycle and motorcycle. 

The following day, they and about 
1,400 other health workers throughout the 
region will set up shop in public spaces: under 
the shade of mango trees, in one-room schools, 
at market stands and in district health centres. 
They will mix and mash the pills with the mor- 
tar and pestle, dissolve them in water in a cup, 
and hand the bitter dandelion-coloured liquid 
to about 164,000 children. 

The effort is part of a broad campaign to 
prevent malaria by providing African children 
with drugs usually used to treat the disease. 
Nearly 1.2 million healthy children from 
parts of Mali, Togo, Chad, Niger, Nigeria and 
Senegal received these drugs during the rainy 
season — from around July to November — 
when malaria usually ravages the population. 

The countries’ governments are deploying 
this intervention — known as seasonal malaria 
chemoprevention, or SMC — with financial 
support from the United States, the United 
Nations and the medical aid organization 
Médecins sans Frontiéres (MSF), also called 
Doctors Without Borders. Next year, many 
plan to expand the campaigns, and other coun- 
tries hope to launch their own, encouraged by 


recommendations from the World Health 
Organization (WHO). 

Preventive use of anti-malarial drugs is not 
new: tourists routinely swallow them when 
travelling. But public-health officials have 
long instructed people living in regions where 
the disease is endemic to refrain from taking 
drugs prophylactically, in part because of con- 
cerns that the parasite that causes malaria will 
develop resistance when many people take the 
medicine on a long-term basis. 

That risk has not disappeared. In fact, 
scientists fully expect SMC to encourage 
widespread drug resistance. No one knows 
when, exactly, but it could happen within 
as few as five years. Until then, SMC has the 
power to prevent 8.8 million cases and 80,000 
deaths each year if implemented in regions 
with high rates of seasonal malaria. That is 
considered a powerful enough benefit to jus- 
tify losing the drugs. 

“Life is a risk,” says Coulibaly, a Malian hired 
by MSF to train local health workers. “And if 
you don't take risks, you don’t win.” 

The project is designed to forestall drug 
resistance as long as possible, and to work in 
concert with mosquito nets and other preven- 
tive methods. Supporters hope that the com- 
bination will significantly suppress malaria, 
so that even if resistance eventually spreads, 
the caseload should be smaller and manage- 
able with other treatments. But SMC will not 
be as successful if funding and infrastructure 
falter — and so far, pro- 
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“SMC is feasible,’ says Estrella Lasry, technical 
adviser for malaria chemoprevention at MSE. 
“But it requires organization, a series of steps 
and money to back it” 


WHEN INTENTIONS BACKFIRE 

Previous attempts at large-scale malaria 
chemoprevention offer lessons on what not 
to do. In the 1950s, David Clyde, a malaria 
researcher with the British Colonial Medical 
Service, administered the drug pyrimethamine 
to villagers in Tanzania. At the time, pyrimeth- 
amine had a strong track record of clearing 
the parasite. But with any drug, there is a slim 
chance that some strains of parasite will be 
resistant and will survive to infect others — a 
chance that increases when many people take 
the medicine in an area where the parasites are 
abundant and circulate year-round. 

Clyde’s experiment drove this concept 
home: malaria rates dropped at first, but after 
five months, 37% of infections in the village 
no longer responded to the drug’. Eight years 
later, pyrimethamine resistance had spread: up 
to 40% of infections within 25 kilometres of the 
original intervention site were unresponsive. 
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In the Debela village in southeastern Mali, a mother gives her baby drugs meant to stave off malaria. 


The 1960s brought more lessons — this 
time, when scientists tried adding the drug 
chloroquine to table salt. Clinical trials had 
shown’ that the salt drastically lowered 
malaria rates. But when the tactic was scaled 
up and the salt was distributed to markets 
in Guyana and Brazil, people consumed 
only what met their tastes. Others opted for 
untreated salt when they could, because the 
chloroquine made their skin itch. As a result, 
many people carried sub-therapeutic levels of 
the drug — not enough to reduce the malaria 
burden, but enough to promote resistance. 
“The salt campaigns were a disaster,” says 
Christopher Plowe, a malariologist at the 
University of Maryland School of Medicine 
in Baltimore. 

Governments and aid organizations mostly 
shelved chemoprevention programmes after 
that, but resistance continued to grow — 
albeit slowly — as people used drugs to treat 
malaria infections. Between 1960 and 2000, 
chloroquine resistance crept around the globe 
and the malaria death toll steadily rose. That 
trend started to reverse around 2005, after the 
widespread adoption of the drug artemisinin, 


derived from Chinese sweet wormwood 
(Artemisia annua). Today, artemisinin-based 
drugs are the gold standard for treating malaria. 


SECOND CHANCE 

Alassane Dicko, a malariologist at the University 
of Bamako in Mali, was a graduate student in 
Plowe’ laboratory in 2001, when he started to 
think seriously about reviving chemopreven- 
tion. Asa child, Dicko had lost his older brother 
and his best friend to malaria. Later, as a medical 
student working in hospitals, he was distraught 
at the number of children he saw dying. “You 
really feel it; he says. “If we want to do anything 
for this country in terms of health, we need to 
stop malaria first? 

Dicko suggested that older antimalarials 
might be repurposed for prevention in places 
where resistance to them is not yet widespread. 
By using drugs seasonally, only in uninfected 
children and in combination rather than alone, 
he hoped to avoid some of the mistakes of the 
past. With drug combinations, parasites need 
to acquire several mutations to survive. These 
mutations usually come at a cost to the para- 
site, so removing the selective pressure of the 


FEATURE | NEWS 


drugs during the dry season would give para- 
sites still sensitive to the treatment a chance to 
outcompete resistant ones. 

Dicko proposed using a mixture of sulpha- 
doxine and pyrimethamine called SP, which was 
known to be relatively safe over the long term. 
In 2002, his team treated 130 children with SP 
for two months in a placebo-controlled trial in 
Mali’. The treatment reduced malaria by 68%. 

Other West African scientists followed the 
study. Among them was Badara Cissé, a Sen- 
egalese researcher then pursuing his doctor- 
ate with malariologist Brian Greenwood at 
the London School of Hygiene and Tropical 
Medicine. Greenwood had been considering 
chemoprevention since the 1980s, and he and 
Cissé immediately grasped the potential in 
Dicko’s approach. In 2004, they began a trial in 
Senegal to test three monthly doses of SP plus 
artesunate, an artemisinin derivative. Com- 
pared with the placebo group, nearly nine out 
of ten malaria cases were averted". 

With a US$4.5-million grant from the Bill & 
Melinda Gates Foundation in 2008, Cissé and 
his colleagues launched an as-yet-unpublished, 
3-year clinical trial to study SP with another 
drug, amodiaquine (to preserve the efficacy of 
artemisinin). They treated nearly 200,000 chil- 
dren under 10 years old and found that they had 
83% fewer cases of malaria than controls, says 
Cissé. Smaller trials in other African nations 
reported similar findings. 

These are impressive numbers, especially 
given how recalcitrant malaria has been to pre- 
ventive measures. No vaccine has ever proved 
fully effective against the disease, for example. 
And the one that is closest to approval — RTS,S 
— has shown disappointing results in ongoing 
clinical trials, with less than a 50% reduction in 
cases (see Nature 502, 271-272; 2013). 


RESISTING THE CRITICS 

SMC raised some concerns that slowed its 
adoption. Some health officials suggested 
that natural, partial immunity to the parasite 
— built up as a child survives multiple bouts 
of malaria — would be compromised. Others 
fretted about the potential side effects of taking 
the drugs regularly. But the loudest complaints 
were about losing the drugs to resistance. 

Ina cramped office in a makeshift building 
at the University of Dakar, Cissé explains how 
he was frustrated by the deliberations among 
public-health officials as malaria waged war on 
Senegal’s children. He slumps in a chair that 
seems much too small for him and asks, “Isn’t 
it selfish to sit in our offices with air condition- 
ing, saying that we should save these drugs?” 
He recalls a single night, 20 years ago, when 
he watched five children die of malaria. There 
was nothing he could do to save them. “If this 
happened to you, you would not be debating 
about the fear of losing a drug,” he says. 

In 2012, SMC finally won over most offi- 
cials. The Cochrane Collaboration — an 
international group based in Melbourne, 
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Australia, that specializes in evidence assess- 
ment — analysed results from trials in Senegal, 
Mali, Burkina Faso, Ghana and Gambia, and 
concluded’ that SMC could prevent more than 
three-quarters of malaria cases in places where 
the disease struck seasonally. In the trials, the 
signs of side effects, resistance and reduced 
immunity were all minimal. According to 
another report’, nearly 21 million children 
in these regions stood to gain from SMC each 
year. And prevention is cheaper than treat- 
ment. Each month, chemoprevention costs 
$1.50 per child, which pales in comparison to 
the costs of travel and medical care for a child 
who falls ill. In November 2012, the WHO 
published SMC-implementation guidelines 
that enabled countries to apply for funds from 
international organizations’. 


SLOW START 

Implementation has been a challenge, however. 
Mamadou Lamine Diouf, the drug-procure- 
ment manager for Senegal’s National Malaria 
Control Program, says that the rollout there 
was supposed to reach nearly 600,000 children 
each month, starting in July and August. But 
he and the US agency footing the bill for the 
medicine had underestimated how much time 
it would take to get these older drugs manufac- 
tured anew and assessed by various organiza- 
tions. By early November, health workers had 
managed to reach only 53,000 children. “We 
are learning by doing,” says Diouf. “Now we 
know that if we don’t master this long supply 
chain, nothing will be possible.” 

Drug delays set back chemoprevention 
pilots in northern Nigeria by a month. Togo’s 
campaign did not start until September. 
Burkina Faso’s project failed to launch when 
funds came up short. And the size of Mali’s 
intended intervention dropped after a coup 
détat and an invasion by al-Qaeda affiliates 
last year sent the nation into disarray. 


A MILLION OUNCES OF PREVENTION 


By November 2013, seasonal malaria chemoprevention (SMC) reached almost 1.2 million children in areas that 
receive at least 60% of their annual rainfall in the rainy season. If SMC were scaled up to cover all areas where it 
might be effective, it could reach 25 million children and prevent an estimated 80,000 deaths each year. 
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smeared on filter paper, placed ina ziplock bag 
and shipped to a laboratory in Bamako, where 
Dicko and his colleagues will look for muta- 
tions associated with resistance to SP and amo- 
diaquine. The University of Dakar will conduct 
similar tests. 

For the campaigns to have a long-lasting 
effect, chemoprevention must work faster 
than the parasites acquire resistance. Support- 
ers hope that the treatments will destroy most 
malaria parasites over the next several years, 
driving down infection rates and keeping them 
down even when resistance begins to spread. 

Ramanan Laxminarayan, director of the 
Center for Disease Dynamics, Economics and 
Policy, a health-policy think tank in Washing- 


“IF WE JUST ROLL THIS OUT WITHOUT 
SURVEILLANCE, WE RISK REPEATING ALL 
OF THE MISTAKES MADE IN THE PAST.” 


Still, with the lessons learned, supporters say 
that they will be better prepared next year (see ‘A 
million ounces of prevention). In March, some 
countries plan to apply for funding from the 
Global Fund to Fight AIDS, Tuberculosis and 
Malaria. Scott Filler, a disease coordinator at the 
Global Fund, which is based in Geneva, Swit- 
zerland, says, “There are not many things that 
can prevent malaria in 75% of children, so we 
will fully support it when countries come to us’ 

As the programmes continue, researchers 
will keep watch to see if resistance to the drugs 
mounts. Randomly selected people who come 
to hospitals to be treated for malaria in Mali, 
Chad and Niger will have a spot of their blood 


ton DC, is sceptical. He predicts that imperfect 
implementation will prevent campaigns from 
having the benefits seen in clinical trials, and 
that the disease will bounce back in the end. 

Importantly, says Paul Milligan, a malaria 
researcher at the London School of Hygiene 
and Tropical Medicine, funding agencies must 
support follow-up evaluations to catch unin- 
tended effects such as increased vulnerability 
to malaria in children who outgrow the inter- 
ventions. “If we just roll this out without sur- 
veillance, we risk repeating all of the mistakes 
made in the past,’ he warns. 

Yet surveillance and drug resistance mean 
little to the mothers who congregate in a small 
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village in the Koutiala region of Mali just after 
sunrise in September. Awa Damale, 25 years 
old and clad in an embroidered aqua dress and. 
matching headscarf, arrives by donkey cart with 
her four children and two from another family. 
Five of the children swallow their medicine, but 
one of Damale’s sons has felt ill this week. He 
tests positive for malaria and gets a referral to 
the nearest clinic. SMC is for prevention only. 

The boy’ illness may be a sign that the drugs 
he took last month are not 100% effective — or 
that he did not swallow all of the medicine — 
but his condition does not dampen Damale’s 
enthusiasm. It is the first time this year that 
one of her children has had malaria. Before the 
intervention, she constantly juggled working on 
the farm with caring for sick children. She does 
not want to hear about the possibility of the pro- 
gramme drying up or the drugs losing potency 
years down the road. Most of her children are 
healthy now, and that is what matters most. m 
SEE EDITORIAL P.165 


Amy Maxmen is a freelance science journalist 
in New York City. Travel for this story was 
paid for by a grant from the Pulitzer Center on 
Crisis Reporting in Washington DC. 


1. Plowe, C. V. Trans. R. Soc. Trop. Med. Hyg. 103, S11- 
$14 (2009). 

2. Giglioli, G., Rutten, F. J. & Ramjattan, S. Bull. World 
Health Org. 36, 283-301 (1967). 

. Dicko, A. et al. Malar. J. 7, 123 (2008). 

. Cissé, B. et al. Lancet 367, 659-667 (2006). 

. Meremikwu, M. M., Donegan, S., Sinclair, D., Esu, E. 
& Oringanje, C. Cochrane Database Systematic 
Rev. 2012 http://dx.doi.org/10.1002/14651858. 
CD003756.pub4 (2012). 

. Cairns, M. et al. Nature Commun. 3, 881 (2012). 

. World Health Organization Seasonal malaria 
chemoprevention with sulfadoxine-pyrimethamine 
plus amodiaquine in children: A field guide (2012). 


oR Ww 


NO 


SOURCE: MSF/MALARIA 


CONSORTIUM/M. CAIRNS, LSHTM/UNICEF/PMI/D. KONE, 
MALI NATL PROG. FOR THE FIGHT AGAINST MALARIA 


Definitive 
guide to the world’s 
birds goes online p.197 


Artists’ riffs on 
synthetic biology engage the 
public in Dublin p.196 


Is predicting 
impact akin to 
/ quantifying dreams? p.198 


science, culture and politics 


Two takes on the 
of altered states of mind p.194 


ANDREW BIRAJ/REUTERS/CORBIS 


In young children, the parasitic infection cryptosporidiosis is one of four leading causes of severe diarrhoea. 


Time to tackle 
cryptosporidiosis 


The little-studied parasite Cryptosporidium is a major threat to infants. 
Boris Striepen calls on microbiologists and funders to give it more attention. 


arlier this year, a massive clinical and 
Prevstemtoe study’ involving 

22,500 children from Africa and Asia 
revealed — unexpectedly — that the proto- 
zoan parasite Cryptosporidium is one of four 
pathogens responsible for the lion’s share of 
severe diarrhoea in infants and toddlers. 
According to the World Health Organiza- 
tion, diarrhoea accounts for 10.5% of the 
nearly 8 million yearly deaths of children 
under five years of age”. (For comparison, 
malaria causes 7% of such deaths, and HIV/ 
AIDS, just 2%.) 


Vaccines and treatments are already 
available or fast being developed for three 
of the four pathogens identified: rotavirus, 
Shigella bacteria and enterotoxigenic 
Escherichia coli (see ‘Child killer’). But 
for ‘crypto, there is no fully effective drug 
treatment or vaccine, and the basic research 
tools and infrastructure needed to discover, 
evaluate and develop such interventions are 
mostly lacking. 

The technical challenges of working on 
crypto in the laboratory have led to the per- 
ception that the pathogen is an intriguing yet 


intractable problem. Crypto lab cultures last 
a few days at most, for instance, and some 
of the species that infect humans cannot 
be easily studied in standard model organ- 
isms such as mice. As a result, funders and 
biologists have tended to shy away from the 
parasite: a search on the biomedical research 
database PubMed suggests that in the past 
five years, about 20 times more articles have 
been published on malaria than on crypto. 
I believe, however, that with the right 
tools, research approach and financial 
backing, the prevention of deaths caused 
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EASY TARGETS 


The parasite Cryptosporidium 


Cryptosporidium 
enters the cells of 
the small intestine 


causes severe infections in young 


Crypto has a simple, 
single-host life cycle 


children and people with weak 
immune systems. 


Infection begins 
with the ingestion 
of water or food 
containing 
spore-like 
oocysts 


by crypto infections is a realistic goal. 
Furthermore, bringing crypto research into 
mainstream microbiology could provide 
fundamental mechanistic insights into intes- 
tinal infections in general. 


LINGERING PROBLEM 

Infection with crypto occurs when people 
ingest spore-like oocysts, usually by drink- 
ing contaminated water (see ‘Easy targets’). 
Parasites emerging from these spores invade 
the epithelial cells lining the small intestine, 
and cause the cells to form internal struc- 
tures that house the parasite. These struc- 
tures supply the parasites with nutrients 
and position them at the interface between 
the digestive tract and its contents. Infec- 
tion generally results in one to two weeks 
of severe diarrhoea, but in people with 
HIV/AIDS or in malnourished children, the 
disease can linger and eventually kill. 

Only one drug treatment for crypto is 
currently approved by the US Food and 
Drug Administration (FDA): nitazoxa- 
nide, marketed under the name Alinia. 
Clinical studies’ in Egypt have shown that 
otherwise healthy children and adults suf- 
fering from diarrhoea caused by crypto 
are twice as likely to recover within a week 
when given the drug. But those people who 
are most vulnerable to the life-threatening 
disease need something better. In a study* 
in Zambia, diarrhoea symptoms lifted for 
fewer than 25% of malnourished children 
after treatment with nitazoxanide. Mean- 
while, the drug seemed to have no effect in 


CHILD KILLER 


Cryptosporidiosis is the second biggest cause 
of diarrhoeal disease and death in infants. 


=a Enterotoxigenic Escherichia coli 
(ie Shigella spp. 
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intestine 


children or adults whose immune systems 
are suppressed, for instance because of HIV 
infection. 

Through horizontal gene transfer, crypto 
parasites have obtained various genes from 
the bacteria sharing their intestinal environ- 
ment, some of which are known to be associ- 
ated with bacterial disease. Crypto may also 
use products of bacterial metabolism to help 
it to grow and cause illness. By targeting the 
products of some of these genes, research- 
ers could home in on treatments that dis- 
rupt pathways in the parasite but that do not 
affect human cells’. 

Furthermore, crypto has a tiny genome, 
a simple life cycle requiring only one host 
and its reproduction is relatively straight- 
forward. So it should be possible to cross 
crypto strains. Crossing strains that differ 
in the types of host they infect, for instance 
— such as healthy mice versus mice with a 
faulty immune system — could shed light on 
the genes and mechanisms causing disease. 

Sequencing the genomes of two crypto 
species that infect hamans*” has revealed 
vulnerabilities stemming from the parasite’s 
minimalist metabolism. Mammalian cells 
have multiple ways of producing certain 
metabolites; crypto often relies on just one, 
using components borrowed from its human 
host or bacteria. Drugs designed to target 
such pathways — interfering with enzymes 
that supply the building blocks of DNA and 
RNA, for example — show some promise. 

Another option might be to repurpose 
drugs used to treat other illnesses*®. A statin 
used to cut down cholesterol, for example, 
can interfere with the ability of crypto to 
get crucial lipids from its host cells. And 
efforts to find treatments could piggyback 
on existing large-scale programmes to iden- 
tify malaria drugs — crypto is related to the 
Plasmodium parasites that cause malaria. 

The main challenge to finding drug treat- 
ments for crypto, however, is the lack of tools 
for validating and testing potential targets 
or promising classes of compounds. Cur- 
rently, assays are neither sufficiently robust 
nor scalable to exploit the high-throughput 
techniques typically used to screen hundreds 
of thousands of drug candidates. 
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ie 


The oocysts pass into 
the colon and are 
released in the faeces 


Released 


Crypto 
infects others 


Parasites that have been genetically 
engineered to synthesize proteins that 
produce light or colour, such as the green 
fluorescent protein from jellyfish, enable 
researchers to detect robust correlates of 
infection in a reproducible and cost-effective 
way for numerous infectious diseases, such 
as malaria and tuberculosis. Such technolo- 
gies would be transformative for the study 
of crypto. 


STEMMING INFECTION 

Even better than finding drugs to treat 
crypto effectively would be preventing the 
disease from taking hold in the first place. 

Although crypto takes a massive toll on 
infants and toddlers, it is rarely detected in 
those older than two years, suggesting that 
children may develop natural immunity. This 
idea is supported by animal studies: in one, 
for instance, vaccines containing crypto pro- 
tected calves and cattle’; in another, piglets 
previously infected with one crypto species 
later showed complete resistance to that same 
species, and showed mild symptoms on sub- 
sequent infection with a different species”. 

Small studies involving about 20 adult vol- 
unteers suggest that although infection can- 
not prevent subsequent disease, it can reduce 
its severity''. So developing vaccinations 
containing parasites that have been rendered 
incapable of causing disease, through irra- 
diation or genetic engineering, is likely to be 
the most hopeful approach. Such vaccines 
have shown promise in preventing other 
parasitic infections such as malaria’” and 
leishmaniasis. 

Technology transfer from established to 
emerging research models can greatly aid 
innovation. For instance, applying knowledge 
and techniques associated with genetic modi- 
fications in fruitflies has revealed how mos- 
quitoes smell and identify humans, offering 
avenues for the development of repellents and 
baits. For crypto, concepts and tools devel- 
oped for the related parasites Toxoplasma and 
Plasmodium, including how to render them 
nonpathogenic, could prove useful. 

Animal models that better mimic human 
infection and disease than current ones 
would allow researchers to study the impact 
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of intestinal commensal bacteria on crypto 
infection. Calves and piglets, for instance, 
are natural hosts of the species of crypto that 
infects humans. 

More money is key. The US National 
Institutes of Health (NIH) currently 
spends roughly US$4.3 million each year 
on 12 crypto projects, compared to around 
$300 million on more than 600 malaria 
projects. 

Review panels for government agencies 
typically take a dim view of studies on non- 
model organisms that have a higher risk 
of failing than those on well-established 
systems, or that require large upfront 
investments in technology. Meanwhile, phil- 
anthropic organizations such as the Bill & 
Melinda Gates Foundation, based in Seattle, 
Washington, tend to focus on intervention 
rather than basic research. And industry 
may not see a market ina disease that mainly 
affects the poor. 

Yet both the NIH and the Gates Founda- 
tion have already made substantial invest- 
ments in projects designed to identify the 
causes of early childhood diarrhoea. Now 
these organizations have a responsibility to 
act on their discoveries. Furthermore, crypto 
kills young animals too, so the possibility of 
preventing it in livestock should stimulate 
industry interest, resources and expertise. 

As well as potentially saving hundreds 
of thousands of young lives and reducing 
untold suffering for infants, children and 
parents, I am confident that bringing crypto 
to the forefront of microbiology research will 
have wide-ranging benefits. It is likely to be 
an excellent model system for understanding 
the complex interaction between pathogens, 
host nutrition, inflammation and the intesti- 
nal microbiome. m 
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WHER 


Ecology must 
evolve 


Tackling global problems requires a fresh approach, 
argues Georgina Mace, as the British Ecological 
Society celebrates its centenary. 


limate change, the threat of 

pandemics, population growth, 

food security and the loss of bio- 
diversity and ecosystem services demand 
anew kind of ecology — one that focuses 
on how whole communities of organisms, 
at the scale of landscapes or catchments, 
interact with people and the physical 
environment. 

The advances in ecology in the past 
century have hugely improved our under- 
standing of species interactions, such 
as those between hosts and parasites or 
between predators and prey, as well as 
population dynamics, food-web dynam- 
ics and how organisms adapt to their local 
environments. Such gains have come 
mostly from a combination of theory and 
modelling, and carefully designed long- 
term laboratory or field experiments’ in 
places as diverse as the Serengeti Wild- 
life Research Centre in Tanzania and the 
University of Oxford’s Wytham Woods 
site, UK. Indeed, historically, careers in 
ecology have tended to revolve around the 


‘ownership’ and analysis of a personal study 
system or a painstakingly curated data set. 

In part because of this history, there 
are few general theories for how multiple 
species respond to perturbations, such 
as disease or shifting weather patterns, 
at the community level”. This is a major 
problem for what is becoming known as 
global-change science. 

The British Ecological Society, founded 
in 1913 “to promote and foster the study 
of Ecology in its widest sense’, has been 
key in disseminating knowledge among 
ecologists over the past few decades, by 
publishing leading journals and organ- 
izing academic meetings, and through 
its increasing presence in education and 
policy. As its outgoing president, how- 
ever, I worry that in today’s world, there 
are more pressing demands. And, as for 
other scientific societies that are reliant on 
revenue from academic journals, the shift 
to electronic and open-access publishing 
could weaken its business model. 

A new role for the BES and similar > 
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of intestinal commensal bacteria on crypto 
infection. Calves and piglets, for instance, 
are natural hosts of the species of crypto that 
infects humans. 

More money is key. The US National 
Institutes of Health (NIH) currently 
spends roughly US$4.3 million each year 
on 12 crypto projects, compared to around 
$300 million on more than 600 malaria 
projects. 

Review panels for government agencies 
typically take a dim view of studies on non- 
model organisms that have a higher risk 
of failing than those on well-established 
systems, or that require large upfront 
investments in technology. Meanwhile, phil- 
anthropic organizations such as the Bill & 
Melinda Gates Foundation, based in Seattle, 
Washington, tend to focus on intervention 
rather than basic research. And industry 
may not see a market ina disease that mainly 
affects the poor. 

Yet both the NIH and the Gates Founda- 
tion have already made substantial invest- 
ments in projects designed to identify the 
causes of early childhood diarrhoea. Now 
these organizations have a responsibility to 
act on their discoveries. Furthermore, crypto 
kills young animals too, so the possibility of 
preventing it in livestock should stimulate 
industry interest, resources and expertise. 

As well as potentially saving hundreds 
of thousands of young lives and reducing 
untold suffering for infants, children and 
parents, I am confident that bringing crypto 
to the forefront of microbiology research will 
have wide-ranging benefits. It is likely to be 
an excellent model system for understanding 
the complex interaction between pathogens, 
host nutrition, inflammation and the intesti- 
nal microbiome. m 
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Ecology must 
evolve 


Tackling global problems requires a fresh approach, 
argues Georgina Mace, as the British Ecological 
Society celebrates its centenary. 


limate change, the threat of 

pandemics, population growth, 

food security and the loss of bio- 
diversity and ecosystem services demand 
anew kind of ecology — one that focuses 
on how whole communities of organisms, 
at the scale of landscapes or catchments, 
interact with people and the physical 
environment. 

The advances in ecology in the past 
century have hugely improved our under- 
standing of species interactions, such 
as those between hosts and parasites or 
between predators and prey, as well as 
population dynamics, food-web dynam- 
ics and how organisms adapt to their local 
environments. Such gains have come 
mostly from a combination of theory and 
modelling, and carefully designed long- 
term laboratory or field experiments’ in 
places as diverse as the Serengeti Wild- 
life Research Centre in Tanzania and the 
University of Oxford’s Wytham Woods 
site, UK. Indeed, historically, careers in 
ecology have tended to revolve around the 


‘ownership’ and analysis of a personal study 
system or a painstakingly curated data set. 

In part because of this history, there 
are few general theories for how multiple 
species respond to perturbations, such 
as disease or shifting weather patterns, 
at the community level”. This is a major 
problem for what is becoming known as 
global-change science. 

The British Ecological Society, founded 
in 1913 “to promote and foster the study 
of Ecology in its widest sense’, has been 
key in disseminating knowledge among 
ecologists over the past few decades, by 
publishing leading journals and organ- 
izing academic meetings, and through 
its increasing presence in education and 
policy. As its outgoing president, how- 
ever, I worry that in today’s world, there 
are more pressing demands. And, as for 
other scientific societies that are reliant on 
revenue from academic journals, the shift 
to electronic and open-access publishing 
could weaken its business model. 

A new role for the BES and similar > 
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> societies could be to help drive the 
kind of ecological understanding that 
would guide and inform efforts to 
address today’s greatest global chal- 
lenges. 


EXPANDING HORIZONS 

No matter how intriguing the details and 
well understood the dynamics and pro- 
cesses, the results of small-scale ecological 
studies are difficult to tie into models sim- 

ulating Earth systems such as the climate, 
ocean circulation or the water cycle. Yet as 
long as researchers model such systems 

without taking ecological processes into 

account, they risk missing significant feed- 
backs, thresholds and constraints. 

Insights from long-term studies, for 
instance of red deer on the Scottish isle 
of Rum, or of various species inhabiting 
Yellowstone National Park in Wyoming, 
must be tied into global-change science. In 
fact, anew kind of ecology is needed that is 
predicated on scaling up efforts, data shar- 
ing and collaboration* *. This would link up 
with related disciplines in the environmen- 
tal, social and physical sciences, and focus 
much more on meta-analyses and synthe- 
sis. We need to identify general trends, such 
as the effect of a change in temperature on 
animal dispersal and the knock-on conse- 
quences for ecosystems, and draw general 
conclusions, for instance on how the prac- 
tices of local fisherman affect the health of 
fisheries. A few scaled-up ecological models 
do exist, notably the emerging Madingley 
model, which is an attempt to simulate 
global ecosystems’. But many more are 
needed. 

The European Research Council, UK 
research councils and the US National 
Science Foundation are being increasingly 
sparing with grants to single investigators. 
Instead, the trend is to award larger sums 
(US$2 million to $10 million) to multi- 
skilled teams of researchers tackling broad 
environmental questions. Presumably, it is 
easier and cheaper to give large sums to a few 
people than smaller sums to many different 
recipients. But this preferential funding of 
large-scale programmes can penalize peo- 
ple who are still in the early stages of their 
careers, because researchers are likely to be 
invited into such teams only after they have 
established themselves. It also threatens the 
funding of long-term ecological studies, 
some of which are already proving invalu- 
able to better understanding the ecological 
consequences of human-induced altera- 
tions, such as climate change. 

Societies such as the BES, the Ecological 
Society of America, the Society for Conser- 
vation Biology and related scientific socie- 
ties for meteorology, geology and geography 
could help to forge a new ecology that would 
be both inclusive and innovative. Goals 
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would be shaped by scientists, 
policy-makers and users of the 
resulting science, rather than by 
recent publishing trends — which 
is often what drives allocations 
from funders at present. 


HARNESS COLLABORATION 

Like most scientific societies, 
the BES relies mainly on revenue from 
owning and publishing academic journals, 
such as the Journal of Ecology and the Jour- 
nal of Animal Ecology. These days, how- 
ever, open-access journals, and publishers 
offering libraries and institutions bulk deals 
for access to many journals, have made 
individual subscriptions less necessary. 
In many societies, membership — which 
for young researchers is now more about 
access to conferences, researchers with 
shared interests, and links to industry and 
policy than to journals — has been falling 
in recent years. 

I believe that the mass of expertise and 
breadth of focus from basic science to policy 
found among members of the BES and other 
societies, as well as among their contacts and 
collaborators, could be used to help identify 
the grand challenges for ecology and envi- 
ronmental-change science, and to design 
and develop research priorities. 

To harness this knowledge, society staff 
need to bring the right people together, 
for instance, through focused meetings, 
and by funding placements for scientists 
in government and in industry. Research 
councils and industry may be willing to 
fund such activities on the grounds that 
the resulting collaborations would help to 
address their own problems and questions. 
Schemes run by the BES, such as those plac- 
ing PhD students in the UK Department for 
Environment, Food and Rural Affairs, are 
hugely beneficial to everyone involved. Such 
efforts need scaling up. 

Furthermore, huge gaps in the science 
and supporting technologies need to be 
closed in many areas, from remote sensing 
and ecological modelling to ‘environmen- 
tal genomics’ — the study of genome-level 
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interactions with the environment. Citizen- 
science innovations such as iSpot, an online 
forum for sharing skills and knowledge 
for species identification, illustrate the 
potential both to improve people’s 
knowledge of plants and ani- 
mals, and to monitor changes in 
species’ phenology, abundance 
and distribution at low cost. 
Scientific societies could help 
to drive the expansion of such 
data-gathering technologies in 
which the interests of users, producers 
and researchers align. For instance, 
iSpot, currently a UK-based initiative, 
would have enormous value globally. 
Societies could also facilitate the bring- 
ing together of related, but operation- 
ally disparate, projects to achieve some 
common goal. 

Often missing from large-scale 
studies of the environment and our 
interactions with it, is the crucial first 
step of framing, testing and reframing 

the conceptual basis for the work. From 
my experience as a coordinating lead author 
in both the Millennium Ecosystem Assess- 
ment (MEA) and the UK National Ecosys- 
tem Assessment, this is key to achieving 
interdisciplinary understanding of terms 
and concepts and to articulating the most 
important questions. For example, in the 
case of the MEA, identifying outcomes 
related to human well-being — such as 
food security or freedom from health prob- 
lems — and the main drivers of ecosys- 
tem change related to ecosystem services, 
allowed experts from diverse disciplines to 
undertake the large, complex assessment of 
the world’s ecosystems. 

By mediating this kind of concep- 
tual framing for global-scale problems, 
numerous scientific societies would have 
a way to support their disciplines and keep 
them relevant to society — now and in the 
future. m 
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A poster from the 1920s reflects the era’s fashion for drug-taking. 


HUMAN BEHAVIOUR 


Bending minds, 
building medicine 


Andrew Robinson assesses two studies on the culture, 
biology and chemistry of psychotropic substances. 


heroin, the human impulse towards 

altered mental states is universal. Now 
two books explore the history of individual 
and social attitudes to psychotropic drugs, 
and the science of pharmacology. 

In Demons, science historian Virginia 
Berridge focuses on drug cultures, primarily 
in Britain and to alesser extent in the United 
States, Europe and Asia. She chronicles the 
consumption of opium, alcohol, tobacco, 
cocaine and cannabis, from the nineteenth 


| be drinking espresso to shooting 


century onwards, and examines attempts at 
official control. Berridge points out that a 
drug's acceptability to mainstream society 
fluctuates more owing to social and cultural 
trends than to medical knowledge. The 
periodic reclassification of drugs is “not a 
‘rational’ process’, she concludes. 

Drugged, by pharmacologist Richard 
Miller, offers the biochemical research 
absent from Demons. We visit university and 
industrial laboratories, learn about medical 
trials and clinical applications, and explore 
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Demons: Our Changing Attitudes to Alcohol, 
Tobacco, and Drugs 

VIRGINIA BERRIDGE 

Oxford University Press: 2013. 


Drugged: The Science and Culture Behind 
Psychotropic Drugs 

RICHARD J. MILLER 

Oxford University Press: 2013. 


the structures of organic molecules and 
their functions. Despite copious diagrams, 
much of the technical detail will be impen- 
etrable to the generalist, unlike the accessi- 
ble Demons. However, Miller casts his net 
wider than does Berridge, covering every- 
thing from tea and coffee to LSD (lysergic 
acid diethylamide) and Prozac (fluoxetine 
hydrochloride), and not limiting himself 
geographically. He also makes lengthy and 
lively excursions into culture. 

Opium gets the most attention in both 
books. The latex from the opium poppy 
Papaver somniferum (from which opium 
and opiates such as morphine come) has 
been cultivated and used medically and rec- 
reationally for millennia, and began to seep 
into the West in the sixteenth century. As 
both Berridge and Miller discuss, the opium 
wars of the mid-nineteenth century, which 
pitted China against the French and British, 
played a key part in international regulation 
of drugs in the twentieth century. Opium 
was ubiquitous in Victorian Britain, almost 
like aspirin today, notes Berridge. Prized as a 
painkiller and stimulant, it was prescribed for 
everything from heart disease to gallstones, 
and taken as pills, lozenges, powder and 
even enemas. In the marshy Fens of eastern 
England, opium-spiked beer soothed fevers, 
chills and rheumatism. Prime Minister 
William Gladstone is said to have taken 
opium in coffee before big speeches. 

By mid-century, the tincture laudanum 
was the most widely used form of opium. 
Medicinal use often led to addiction, as 
with early adopters such as Thomas De 
Quincey, author of the autobiographical 
Confessions of an English Opium-Eater 
(1821). Wilkie Collins, whose detective 
novel The Moonstone (1868) featured opium 
addiction, became so dependent on lauda- 
num that he eventually took it in quantities 

large enough to kill an unhabitu- 

ated person, and suffered para- 
noid delusions. Even Arthur 
Conan Doyle’s fictional 
detective Sherlock Holmes, 
created in the 1880s, 
injected himself with 
morphine and cocaine 
for “mental exaltation’, 
although in the final sto- 
ries Holmes has been 


Laudanum (opium tincture) 
was a Victorian panacea. 


WEIMAR ARCHIVE/MARY EVANS PICTURE LIBRARY 


SCIENCE MUSEUM, LONDON/WELLCOME IMAGES 


weaned off the habit, NATURE.COM 
presumably in defer- W.F. Bynumreviews 
ence to growing social —_ anexhibition on 
disapproval. mind-altering drugs: 
Opium, alcohol _ go.ature.com/uciqym 
and cocaine were also 
active ingredients in many US patent medi- 
cines in the nineteenth century. By 1900, the 
modern concept of addiction was begin- 
ning to emerge and the tide began to turn 
against cocaine and opium. In 1913, the last 
officially traded chest of opium was shipped 
from British India to China. In 1914, the US 
government passed the Harrison Narcotics 
Tax Act to limit the use of all opiates and 
coca products to medicine. In the 1920s, the 
influential International Opium Convention 
heralded the start of global drug control. 
Cannabis (hemp) features more promi- 
nently in Drugged than in Demons. Although 
the drug was available in the United States 
long before, in the 1930s Mexican migrant 
labourers introduced North America to the 
practice of marijuana smoking. Marijuana 
was adopted first by black jazz musicians, 
and then, from the 1950s, by white society. Its 
demonization by the US government began 
with the federal Marihuana Tax Act of 1937, 
which made it prohibitively expensive for any 
purpose, including medicinal use. As Miller 
powerfully describes, the legislation was the 
result of racist scare tactics against Mexicans 
and black people by Harry Anslinger, the 
commissioner of the Federal Bureau of Nar- 
cotics from 1930 to 1962. He was supported 
by leaders of the synthetics industry, who 
wanted to end nascent US hemp production. 
As for alcohol, the United States’ nation- 
wide prohibition from 1920 was repealed 
in 1933, during the Great Depression. The 
federal government needed the tax revenue. 
Moreover, attitudes had changed: prohibi- 
tion had not solved industrial issues such as 
absenteeism, yet it had encouraged organized 
crime, political corruption anda trade in poi- 
sonous illicit alcohol. However, as Berridge 
stresses, prohibition also significantly reduced 
US drinking and alcohol-related disease, and 
might have worked had it been less restric- 
tive. It would have failed in Britain, she argues, 
because of the complex British attitude to 
drunkenness. Norman Kerr, a physician and 
founder of the Society for the Study and Cure 
of Inebriety in 1884, asked whether inebria- 
tion was “a sin, a crime, a vice or a disease”. 
This is a question that many are still attempt- 
ing to answer in binge-drinking Britain. 
However the use and abuse of specific 
drugs may shift over time and space, the 
need to bend the mind seems tightly tied 
to human culture. Berridge and Miller ably 
unpick the threads of that universal urge. m 


Andrew Robinson is the editor of The 
Scientists: An Epic of Discovery. 
e-mail: andrew.robinson33@virgin.net 


Books in brief 


= Aid on the Edge of Chaos: Rethinking International Cooperation in 
BW RAUALncay | a Complex World 
Ben Ramalingam OXFORD UNIVERSITY PRESS (2013) 
Many see international development aid as in thrall to linear, 


| EDGE mechanized thinking, and champion approaches in which local 
( people solve their own challenges with intelligently tailored backing. 
CHAgS Complex adaptive systems thinking, argues Ben Ramalingam, offers 
“= | a scientific model for that path. Awareness of real-world dynamics 
= = | would, he avers, promote fine targeting of aid efforts and foster a 
} new aid paradigm: an “open innovation network, catalysing and 


leveraging change in countries around the world”. 


Owning the Earth: The Transforming History of Land Ownership 
Andro Linklater BLOOMSBURY (2013) 

Historian Andro Linklater hinges his layered chronicle of the past five 
centuries on land ownership, a growing concern on a crowded Earth. 
Arguing that land claims work only when yoked to public interest, 

he shows they were also rare up to 1800: before then, much of 
Europe was jointly ‘owned’ by peasants and royalty, for instance. 
The tide had begun to turn centuries before, however, when English 
freeholders gained unprecedented power. A brilliant treatise on the 
social and political shifts in our journey from indigenous ‘commons’ 
to the subprime-mortgage catastrophe of 2007 and beyond. 


Pure and Modern Milk: An Environmental History since 1900 
Kendra Smith-Howard OXFORD UNIVERSITY PRESS (2013) 

From cream and cheese to milk bottled, dried and lurking in 
everything from cake to glue, ‘dairy’ is ubiquitous. Yet getting the 
highly perishable, machine-pumped product of lactating cows to 
consumers has been a hugely complex technological, cultural and 
political saga. Kendra Smith-Howard deftly traces that trajectory 
in the United States since 1900. Initially rife with bacteria such 

as Salmonella typhi, milk became today’s ‘natural’ staple through 
scientific manipulations — in bovine breeding, bacteriology, 
processing technology, and refrigerated bulk storage and transport. 


In Meat We Trust: An Unexpected History of Carnivore America 
Maureen Ogle HOUGHTON MIFFLIN HARCOURT (2013) 

In 2011, Maureen Ogle informs us, the United States produced 

42 billion kilograms of pork, poultry and beef. The downsides 

are considerable: US livestock production and the meat industry 
have drawn fire for everything from pollution to overuse of 
pharmaceuticals. Ogle delivers a well-researched history of the 

US meat habit in the past 200 years. Although she decries twenty- 
first-century feedlots and “manure lagoons”, Ogle argues that the 
culprits are not solely industrial agriculture and government laxity: 
US consumers’ sense of entitlement to cheap meat is the real issue. 


The Ecological Hoofprint: The Global Burden of Industrial 
Livestock 

Tony Weis ZED Books (2013) 

This cogent analysis of rising global meat consumption puts 

the blame firmly on industrial livestock production — a meat 
‘monoculture’ that has quadrupled in the past 50 years. Agrarian 
political economist Tony Weis calculates that the cost to the 
environment and animal welfare is unsustainable, and the effect on 
social equity negative: livestock consume a third of all grain, while 
one person in seven remains malnourished or hungry. Barbara Kiser 
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In Stranger Visions, a work in Grow Your Own, Heather Dewey-Hagborg creates portraits from found DNA. 


SYNTHETIC BIOLOGY 


The second creation 


Anthony King finds himself on some of the wilder 
shores of synthetic biology at Dublin’s Science Gallery. 


eering into a fridge, I am invited to 
Pp sniff a cream cheese labelled “Chris- 
tina” It is crafted from milk and starter 
cultures sampled from someone’s skin. 
There are three more, each with a unique 
set of microbes from a different person: 
cheesy doppelgingers that will never, in our 
pasteurized world, reach the table. 
Selfmade is the work of synthetic biologist 
Christina Agapakis and smell provocateur 
Sissel Tolaas, who are making the human 
microbiome tangible and highlighting a 
potential conflict between our antibacterial 
present and a future filled with engineered 
bacteria. Agapakis and Tolaas are members 
of Synthetic Aesthetics, a project run by the 
University of Edinburgh, UK, and Stanford 
University in California, to research the bor- 
ders between art and synthetic biology. Theirs 
is one exhibit in the Dublin Science Gallery's 
Grow Your Own... Life After Nature show, 
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Grow YourOwn... _ which explores genetic 
Life After Nature engineering. 

Science Gallery, Trinity The show, which 
College, Dublin. 


is split into sections 
labelled ‘Life’, ‘Soci- 
ety’ and “Machines; offers a brew of utopian 
and dystopian views. There are also chances 
for visitors to ‘grow their own life form in a 
biolab and, as usual in this venue, there is 
an emphasis on citizen engagement with 
science. 

In the Life section, I ponder Escheri- 
chia coli that has been modified to smell of 
banana, and then an exhibit exploring the 
idea that a woman could carry the fetus of a 
rare dolphin. In Society, upstairs, I discover 
how to ‘biohack’ the first commercially avail- 
able genetically modified flower — the blue 
Moondust carnation — by using basic plant- 
culture methods to re-root the cut flowers 
and plant them to rejoin the carnation gene 


Until 19 January 2014. 


R 2013 
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pool. This do-it-yourself project raises ques- 
tions about the ownership of life, including 
the moral question of whether the hacked 
plants should be released. 

Artist David Benqué gives the heated 
debate on genetically modified organ- 
isms a novel twist. In his installation The 
New Weathermen, fictional environmen- 
tal activists use madcap synthetic-biology 
schemes to fight corporate biotechnology 
interests. The exhibit takes the form of their 
scaled-down rigs, such as a “self-inflating 
antipathogenic membrane pump” and an 
“autonomous seed disperser’, with which 
Benqueé’ anarchists spray golf-course grass 
modified for herbicide tolerance to spread 
their own open-source super-grass. 

In the Machines section sits a piece by 
design futurist Natsai Audrey Chieza. 
Chieza collaborates with microbiologist 
John Ward at University College London 
to find soil bacteria that produce pigments 
suitable for dyeing textiles. In Faber Futures 
—a reference to the Latin for ‘to create’ — 
Petri dishes are filled with Streptomyces 
bacteria from soils in which different herbs 
were growing. Chieza removed specific 
colonies displaying various colours to dye 
silk scarves: bacteria isolated from around 
tarragon yielded blood-red hues; spear- 
mint, deep purple; and oregano, light blue 
to navy. “Sometimes the most beautiful 
colours emerge if a contaminant comes in,” 
says Chieza. 

Show curator and exhibitor Alexandra 
Daisy Ginsberg’s Designing for the Sixth 
Extinction in the Society section is a natu- 
ralistic ‘photo’ of a verdant forest inhabited 
by strange, digitally inserted creatures — 
such as a roving seed-dispersal device to 
increase plant biodiversity. Patented life 
forms released into the wild to assist endan- 
gered species, they are constructed using 
an expanded DNA code of six bases rather 
than the usual four. This produces non-bio- 
degradable proteins, which are more robust. 
“Tt sets up a future where we could preserve 
and maintain nature using synthetic biol- 
ogy,’ says Ginsberg. 

Most people know little about synthetic 
biology, despite the best intentions of lead- 
ers in this field such as Craig Venter and 
George Church. “The foundations are being 
laid now,’ Ginsberg says. “That is why it’s so 
important that people understand what it is 
and the many different directions it could 
go in” Grow Your Own urges us to imagine 
many possible engineered futures — positive 
and negative, created with or without our 
consent — and challenges us to engage with 
the research by joining in. In January, visitors 
will even be able to use DNA to nudge E. coli 
into fluorescing in the exhibition biolab. = 


Anthony King is a writer based in Dublin. 
e-mail: anthonyjking@gmail.com 
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ORNITHOLOGY 


World wide wings 


Stuart Pimm joyously clicks his way through an 
ornithological classic as it goes online. 


many species of bird I 

have seen (3,763) and that 
just over 200 people report having seen 
more. The eccentricity of ornithologists 
may be exceptional, but an interest in birds 
is not. Bird watchers are abundant (the 
Royal Society for the Protection of Birds is 
Britain’s biggest nature-conservation char- 
ity) and ubiquitous (there are bird lists 
for the North and South Poles, and surely 
everywhere in between). So the combined 
knowledge of birdwatchers is prodigious. In 
the Handbook of the Birds of the World Alive 
(www.hbw.com), edited by a team led by 
Josep del Hoyo, that knowledge now has an 
online home, with a basic annual subscrip- 
tion available for €29.95 (US$41). 

The online handbook defines what we 
know about the biogeography, behaviour 
and ecology of the almost 10,000 extant 
avian species at a time of unprecedented 
environmental change. It allows the compi- 
lation of instant, custom-made field guides. 
Its facility for species comparisons will 
accelerate comparative studies, and embed- 
ded user-supplied data will open up new 
research opportunities. 

The first volume of Handbook of the 
Birds of the World was published in 1992 by 
Lynx Edicions of Barcelona, Spain. Some 
15 million words, 1,000 colour plates and 
100,000 references later, the seventeenth and 

final volume arrived 


| Fees to knowing how 


Pandnook cris this year. These gor- 
Birds of the World . 

Alive geous books carry the 
EDITEDBYJOSEP DEL __ limitations of print, of 
HOYO ETAL. course — for instance, 
Lynx: 2013. they demand the 


4. Goldie’s Bird-of-paradise 
(Paradisaea decora) 3 


5. Red Bird-of-paradise 
(Paradisaea rubra) 3 


tedious perusal of indexes to find informa- 
tion, and provide no easy connection to the 
materials quoted. 

The online version makes obvious ques- 
tions easy. So, for ‘what?’ I click on a thumb- 
nail of gulls and get a plate, scaled by size, of 
the 51 species. For ‘where?’ I filter the search 
by Chile and Argentina and find 14 species I 
might see on an upcoming trip. I could have 
added terns, other bird families or other 
countries. The ‘compare’ feature stacks sim- 
ilar species, such as Belcher’s and Olrog’s 
gulls, side by side. The taxonomy section 
tells me that taxonomists once considered 
these two birds to be the same species. 
Maps show that the former lives on Pacific 
coasts and the latter on Atlantic ones; text 
compares descriptions, habitats and natural 
history. 

The International Union for Conserva- 
tion of Nature’s Red List gives Olrog’s gull 
‘vulnerable’ status. Using an embedded link, 
Iaccess the curated Internet Bird Collection: 
9,312 bird species feature in a collection 
that includes almost 100,000 photos and 
80,000 videos. Ten of the items for Olrog’s 
gull are connected to a Google Earth map. 
I shall soon sail past one of these locations. 
The prospect of a sighting is unreasonably 
exciting. 

For many species, the best way to find 
and identify birds is through their call, so 
I click the link for Xeno-canto, the online 
database of bird calls and songs. It provides 
both a recording and a sonogram. Another 


6. Emperor Bird-of-paradise 
(Paradisaea guilielmi) 3 


The online Handbook of the Birds of the World Alive allows a comparison of similar species. 


A roseate spoonbill (Plateala ajaja) soars 
through the sky in Florida, USA. 


link takes me, via Cornell University’s Lab of 
Ornithology, to eBird — a massive, global, 
crowd-sourced compilation of observa- 
tions. A link to BirdLife International tells 
me why Olrog’s gull is threatened. Active 
links to original sources are included, too. 
I can add my comments online and point 
to egregious failures to quote every last 
one of my published bird papers, however 
obscure. And Google's translation technol- 
ogy means that the whole resource can be 
read in 71 languages. 

The march of online and crowd-sourced 
data is changing research completely and 
goes far beyond birds. For example, through 
smartphone apps such as iNaturalist, ama- 
teurs and professionals alike are sharing 
information on the species they are pas- 
sionate about — be they frogs, seahorses or 
wild pigs — from Aiguabarreig Segre-Cinca 
in Catalonia, Spain, to Xishuangbanna in 
Yunnan, China. Through these, we are rap- 
idly learning precisely where species are and 
so can quickly pinpoint where habitat loss 
and climate disruption may move them — 
and where extinctions happen. 

The handbook’s users can check off the 
species they have seen in their lifetime. 
Sightings are reported for 14 of the 20 spe- 
cies that have recently become extinct or 
extinct in the wild, meaning that extinc- 
tions are current, not historical, events. 
The handbook lists 176 species as ‘criti- 
cally endangered’. When, and how often, 
birders last counted these provides the 
grist for the statistical mill that predicts 
how likely it is that a species still survives, 
and so can be saved (see S. Pimm Nature 
426, 235-236; 2003). Out of eccentric- 
ity comes insight and, perhaps, even 
conservation action. m 


Stuart Pimm is professor of conservation 
at the Nicholas School of the Environment, 
Duke University, Durham, North Carolina, 
USA, and author of The World According 
to Pimm: A Scientist Audits the Earth. 
e-mail: stuartpimm@me.com 


14 NOVEMBER 2013 | VOL 503 | NATURE | 197 


© 2013 Macmillan Publishers Limited. All rights reserved 


VISUALS UNLIMITED/NATURE PICTURE LIBRARY 


HANDBOOK OF THE BIRDS OF THE WORLD ALIVE 


Correspondence 


Impact: akin to 
quantifying dreams 


In our haste to measure 
everything in order to wring out 
evidence that non-specialists 

can understand and to secure 
funding, we forget that 
predicting the impact of research 
is akin to quantifying dreams 
(Nature 502, 271; 2013). There 
are no short cuts for proper 
research assessment. 

The impact of research on 
society is a composite of many 
strands of work, usually by 
different scientists and engineers, 
which — often serendipitously — 
culminate years later in changing 
some aspect of our lives. But 
attempting to disentangle those 
threads is a hopeless task. 

There is probably little 
prognostic value in counting 
research-paper downloads, for 
example. In fact, such metrics are 
but surrogates of real research 
impact and can generate goals 
of their own. They encourage 
‘gaming, or manipulation of data 
to artificially improve metrics. 

When used over time within 
institutions, metrics can be 
useful guides — we all need 
external measures of some sort. 
It is when they are used as a form 
of currency in their own right 
that we get into trouble. 

Jim Woodgett Lunenfeld- 
Tanenbaum Research Institute, 
Toronto, Canada. 
woodgett@lunenfeld.ca 


Impact: take peer 
review into account 


Recognition ofa researcher's 
impact should include a measure 
of his or her contribution as a 
peer reviewer in maintaining high 
scientific standards in research 
papers and grant applications 
(Nature 502, 287; 2013). 

It should be feasible to create 
a reviewers equivalent of the 
h-index — which measures 
the impact of research output 
in terms of its quantity and 
popularity. This might indicate, 
for example, the number and 


impact of journals and papers for 
which the reviewer has acted as a 
referee. Publishers releasing these 
figures would need to protect 

the blind or double-blind review 

process. 

They could do this by using 
a central repository to assign 
these metrics to particular 
researchers through identifiers 
such as the Open Researcher 
and Contributor ID scheme (see 
Nature 485, 564; 2012). 

As well as securing the 
reputation of reviewers, such a 
measure would benefit journals 
and publishers by encouraging 
more scientists to undertake 
refereeing. 

E. Tobias Krause Bielefeld 
University, Germany. 
tobias.krause@uni-bielefeld.de 


Impact: China needs 
to review its metrics 


Impact metrics are creating their 
own problems in China (see 
Nature 502, 271; 2013). 

By recognizing only the first 
and corresponding authors 
on a paper, for example, 
China's metrics are indirectly 
discouraging participation in 
research consortia. The practice 
is counterproductive in this era 
of collaborative science. 

Furthermore, review 
articles do not count towards 
impact metrics in China, so 
they do not help to advance 
an author's academic career. 
Information overload means 
that multidisciplinary, scholarly 
and timely reviews are more 
in demand than ever. Review 
authors need comprehensive 
knowledge, expert insight and 
outstanding inductive and 
deductive abilities. We therefore 
believe that this apparent 
discrimination is unjustified. 
Xiangyu Ma College of 
Preventive Medicine, Third 
Military Medical University, 
Chongqing, China. 
Zhiyuan Song Southwest 
Hospital, Third Military Medical 
University, Chongqing, China. 
xymacq@hotmail.com 
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US shutdown should 
spur other nations 


What the recent US government 
shutdown has highlighted is just 
how much it contributes to the 
world’s research infrastructure — 
with PubMed a striking example. 
By comparison, governments 

of other high-GDP countries 
contribute shamefully little. 

The world urgently needs 
freely accessible, searchable, 
comprehensively referenced 
and cross-linked international 
repositories for the burgeoning 
amounts of untapped ‘omics’ 
data (lipidomics being one 
example). It is time for other 
wealthy countries to create and 
fund more such platforms. 

Julio Licinio, Ma-Li Wong 
South Australian Health and 
Medical Research Institute, and 
Flinders University, Adelaide, 
Australia. 

julio licinio@sahmri.com 


Riding shutdowns in 
developing world 


The recent US government 
shutdown hit researchers in the 
Antarctic particularly hard (G. E. 
Hofmann Nature 502, 431-432; 
2013). But spare a thought for 
the feats of endurance shown by 
scientists from the developing 
world, who continue working in 
such harsh environments while 
contending with chronically 
unpredictable political factors in 
their own countries. 

These scientists regularly 
battle long-term, government- 
induced setbacks to their 
research programmes. 
Regardless of their scientific 
importance, these studies 
do not receive widespread 
publicity when things go 
wrong — because such events 
are so frequent. But adverse 
environmental conditions offer 
only a limited logistical window 
for researchers, and natural 
systems will not wait for the 
resolution of political brawls. 

Examples of such successful 
long-term research programmes 


from developing-world scientists 
include Indian and Brazilian 
initiatives in the Antarctic 

(S. D. Gad Curr. Sci. 95, 151; 
2008, and P. Artaxo et al. Tellus 
44B, 318-334; 1992), and an 
ongoing subantarctic marine- 
mammal study by South 
Africans that has so far lasted for 
40 years (M. N. Bester et al. Afr. J. 
Mar. Sci. 33, 511-521; 2011). 

P. J. Nico de Bruyn University of 
Pretoria, South Africa. 
pjndebruyn@zoology.up.ac.za 


Science is not about 
simple stories 


Presenting science as a compelling 
story is becoming a popular way 
of communicating results — a 
technique that is guaranteed 
to capture the attention of the 
scientific community and the 
public. Although science needs 
great stories, stories are not science. 
Storytelling glosses over 
uncertainties; methodological 
detail and even results can get 
lost in the narrative’s overarching 
trajectory. By bounding over 
hurdles, it keeps the listener 
rooting for an amazing ending. 
But we should not forget 
that the temptation to package 
results into a satisfying story has 
frequently led to the retraction of 
scientific papers. 
Jeroen Bergmann The Brain 
Sciences Foundation, Rhode 
Island, and Massachusetts 
Institute of Technology, USA. 
jeroen.bergmann@media.mit.edu 


CORRECTION 

The Correspondence ‘Discard 
Soviet doctorate systems’ 

(A. Gorobets Nature 503, 39; 
2013) wrongly implied that 
researchers in Russia, Ukraine, 
Belarus and other former Soviet 
states do not rely on peer- 
reviewed publications for career 
advancement. In fact, they do 
not rely on international peer- 
reviewed publications. Also, 
doctorate-degree candidates 
are not classified as ‘students’. 


NEWS & VIEWS 


APPLIED PHYSICS 


A cascade laser’s random walk 


Arandom array of holes etched in a semiconductor structure, consisting of a periodic series of thin layers, has been 
demonstrated that emits mid-infrared laser radiation. The device could have sensing and imaging applications. 


HUI CAO & STAFFORD W. SHEEHAN 


ifty-three years after their initial 
Phzmonstfation by Theodore 

Maiman!, lasers are now ubiqui- 
tous in our daily lives. We use them at the 
checkout counters of supermarkets, to 
read or write to DVDs, or just to surf the 
Internet (through fibre-optic communi- 
cation systems). Lasers are also widely 
used in industrial settings and hospitals. 
Over the years, scientists and engineers 
have developed many types of laser using 
various materials for diverse applications. 
But despite such tremendous advances in 
laser technology, it is still a pleasant sur- 
prise to learn of developments that may 
pave the way for further cutting-edge 
applications. One example of this is a 
special class of laser — a random laser — 
operating in the mid-infrared that Liang 
et al. report” in Advanced Materials. 

A laser has two basic components: a 
gain medium, which is a material that 
emits and amplifies light; and an optical 
cavity, a structure that traps light tem- 
porarily to make amplification efficient. 
The simplest laser optical cavity is made 
up of two mirrors facing each other, 
one highly reflective and the other par- 
tially transmitting, with a gain medium 
between them. Light is reflected back 
and forth between the mirrors, and each 
time it passes through the gain medium 
it is amplified, until it leaves through the 
partially transmitting mirror. A lasing 
threshold, beyond which laser light is emit- 
ted, is reached when the amplification of light 
caused by the gain medium compensates for 
the loss of light from the mirrors. 

Unlike a conventional laser, a random laser 
uses a highly disordered structure to trap the 
light**. Although they might seem incom- 
patible with lasers, disordered materials are 
common in our daily lives: milk, toothpaste, 
paper and clouds are just a few examples. 
They all consist of small particles or droplets 
that scatter light. When a light wave hits a 
particle, its direction of travel is changed 
randomly, and that particle is called a scatter- 
ing centre. After propagating farther, the 
light wave will hit another scattering centre 


\ 17 


Mid-infrared 
random 
lasing emission 


Voltage 


Semiconductor 


membrane 
Substrate 


Quantum-cascade 
active region 


Figure 1 | An electrically pumped mid-infrared random 
laser. Liang et al.” fabricated a random array of holes open to 
the air in a semiconductor membrane on a substrate. When 
an external electrical voltage is applied between two gold 
electrodes, electrons are injected from the top electrode to 
the quantum-cascade active region in the membrane, which 
subsequently emits and amplifies light. That light is scattered 
by the air holes, bouncing from one hole to another before 
escaping vertically or laterally. The multiple scattering and 
interference of light traps it temporarily inside the active 
region, enhancing light amplification. When light leakage and 
absorption are balanced by this amplification, lasing occurs. 


and change direction again. In a system that 
consists of thousands of scattering centres, a 
light wave will be scattered many times and 
undergo a ‘random walk. Multiple scatter- 
ing in this manner increases the path length 
of the light, in turn increasing its dwell time 
in a disordered structure. Ifa gain medium 
consisting of dye molecules is incorporated 
between scattering centres, light amplifica- 
tion is enhanced by multiple scattering, and 
lasing can occur. 

Since the pioneering work of Nicolay Basov” 
and Vladilen Letokhov’ in the 1960s, extensive 
studies** have shown that random lasing is a 
phenomenon that applies to numerous sys- 
tems, including powders, colloids, polymers, 
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fibres and even biological tissues. Lasing 
frequencies range from the near-ultra- 
violet to the near-infrared. A major 
obstacle to the practical application of 
random lasers is the expensive source 
that is required to pump energy into the 
gain medium so that it can amplify the 
light. There are two pumping schemes: 
one is optical, involving absorption of 
light from another source (usually a laser 
at a different frequency); the other is elec- 
trical, such as injection of current from a 
battery. The second scheme is much 
cheaper and more convenient for many 
applications. However, most random 
lasers are optically pumped. 

Liang and colleagues’ work represents 
a major advance in random-laser tech- 
nology. The authors pushed the lasing 
frequency to the mid-infrared, and 
achieved electrical pumping. To do this, 
they etched a random array of air holes 
into asemiconductor membrane (Fig. 1). 
Light propagating in the membrane 
is scattered from one hole to another, 
randomly changing direction at each one 
— rather like a drunk person wandering 
ina forest and changing direction when- 
ever he encounters a tree, to use a com- 
mon random-walk analogy. Because this 
structure is fabricated by lithography and 
chemical etching, the diameter, position 
and density of the holes can be accurately 
determined, allowing precise control of 
the scattering strength. When the scatter- 
ing is sufficiently strong, random lasing 
occurs. The lasing threshold decreases with 
increasing air-hole density, because more holes 
help to trap light more effectively. 

To push their lasing wavelength down to 
10 micrometres, Liang et al. used a quantum 
cascade laser, which is an intrinsically elec- 
trically pumped device that provides strong 
amplification of light in the mid-infrared’. The 
membrane comprises a periodic series of thin 
layers of compound semiconductors. When an 
external voltage is applied, a representation of 
the energy levels of electrons in these layers 
would look like a staircase. Once an electron 
is at the top of the staircase, it begins to cas- 
cade down and emits a photon with every step 
that it descends. The quantum cascade laser’s 


emission frequency is determined by the 
energy drop at each step, which can be adjusted 
by changing the layer thickness. 

Liang and co-workers’ quantum cascade 
random laser raises hopes of new imaging 
applications. Many biochemical molecules 
have spectral ‘fingerprints’ in the mid-infrared 
range, a property that can be used for sensing 
and imaging applications — for example, in 
trace-gas sensing for pollution control and 
environmental monitoring; combustion 
diagnostics; and medical diagnostics (such 
as breath analysis). A crucial characteristic 
of random lasers is that the spatial coherence 
of their laser emission can be easily tuned’. 
Spatial coherence describes the correlation 
between waves at different points in space: 
sunlight shining through a cloud has low 
spatial coherence, whereas stars in the night 
sky have high spatial coherence. Low spatial 
coherence is desirable for parallel imaging, 
because light illuminating different spatial 
positions is uncorrelated, and so coherence 
artefacts such as crosstalk and speckle (random 
granular patterns) are avoided’. 

To produce emission that has low spatial 
coherence, a random laser must have many 
modes (oscillating waves) lasing simulta- 
neously*. Liang and colleagues’ laser sup- 
ports only a limited number of random-lasing 
modes, so its spatial coherence could still be 
high, although this has not been measured. 
A further increase in the laser’s scattering 
strength would probably cause more modes 
to lase, thereby reducing the spatial coher- 
ence of the emission. However, because 
random-laser emission is non-directional, 
in this case a clever scheme would need to be 
developed for the efficient collection of total 
emission. 

Looking to the future, this method of making 
mid-infrared random lasers could be extended 
to terahertz (10'* Hz) frequencies. In fact, 
lasers known as quasi-periodic, distributed- 
feedback quantum cascade lasers operating 
in the terahertz regime were constructed 
only three years ago’’. Common packaging 
materials such as cardboard and plastics are 
transparent to terahertz radiation, making it 
useful for package inspection, quality control 
and non-destructive testing’. A combination of 
low spatial coherence and high radiance would 
also make a terahertz random laser suitable 
for applications such as high-speed parallel 
inspection. m 
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Metal ghosts in the 
splicing machine 


Chemical analysis of the spliceosome’s active site reveals that it is the RNA 
components of this enzyme complex that coordinate the catalytic metal ions 
responsible for production of a spliced messenger RNA. SEE ARTICLE P.229 


SCOTT A. STROBEL 


enes are transcribed as pre-messenger 

RNA molecules in which the coding 

exon segments are typically inter- 
rupted with non-coding sequences, or introns. 
To create a functional mRNA, the introns must 
be removed and the exons joined together in 
a process known as pre-mRNA splicing. The 
two phosphoryl-transfer reactions of splicing 
are catalysed by the spliceosome, a metallo- 
enzyme complex comprising five small 
nuclear RNAs (snRNAs) and dozens of core 
proteins’. The spliceosome uses magnesium 
ions at its active site as catalytic cofactors, 
and researchers have long been captivated by 
the question of whether the ligands for these 
metal ions are provided by the snRNAs or by 
the proteins, as this speaks to the evolution- 
ary roots of the splicing process. On page 229 
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of this issue, Fica et al.’ report that a specific 
constellation of phosphate oxygens in the 
U6 snRNA supplies the ligands that coordi- 
nate the catalytic metal cofactors in the active 
site. This finding establishes the concept 
that pre-mRNA splicing is fundamentally an 
RNA-catalysed reaction. 

Fica and colleagues used an approach 
known as a metal-specificity switch to estab- 
lish how the catalytic metal ions are bound 
within the spliceosome. They systematically 
introduced single atoms of sulphur in place 
of phosphate oxygens in the U6 snRNA of the 
spliceosome. Of the 20 positions tested, they 
found five substitutions that impaired splicing 
in the presence of magnesium ions (Mg), a 
metal that binds efficiently to oxygen but not to 
sulphur. The splicing activity of these impaired 
spliceosomes could be rescued by the addi- 
tion of manganese (Mn”*) or cadmium (Cd”*) 
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Figure 1 | Active sites of group II introns and the spliceosome. a, Group II introns are RNA elements 
that self-catalyse splicing — the removal of introns (red) and the joining of adjacent exon regions (green). 
They contain a hairpin structure called domain V; the phosphoryl-transfer reactions of splicing (the first 
of which is shown with an orange arrow) are catalysed by two divalent metal ions (M1 and M2) that are 
coordinated (dashed purple lines) to phosphate-group oxygen atoms of nucleotides (asterisks) in this 
region’, to a non-bridging oxygen at the splice site and to an OH group from an adenosine nucleotide (A) 
in the intron. b, Fica et al.’ now show that RNA also catalyses these reactions during pre-mRNA splicing by 
the spliceosome — a metalloenzyme complex composed of small nuclear RNAs (snRNAs) and proteins. 
They show that the U6 snRNA of the spliceosome (which forms a hairpin structure similar to that of 
domain V) provides the phosphate oxygens that bind to two catalytic metal ions at the enzyme’s active site, 
ina similar manner to the metal-ion coordination of group II introns. The adjoining U2 snRNA of the 


spliceosome is also shown. 
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ions — metals that bind more efficiently to 
sulphur than does magnesium. Such changes 
in metal specificity constitute a chemical signa- 
ture that has been used to establish metal-ion 
binding to specific oxygens within other RNA 
active sites’. 

Efforts to map metal-ion binding sites in the 
U6 snRNA have been made before using this 
approach, albeit on a less comprehensive scale 
(for example, see ref. 4). What sets Fica and 
colleagues’ work apart is the triangulation of 
binding sites in U6 to the reactive groups in 
the pre-mRNA substrate. The authors estab- 
lish that at least four of the five phosphate 
oxygens are coordinated to the catalytic Mg** 
ions during the two phosphoryl-transfer reac- 
tions of splicing. They demonstrated this by 
preparing complexes with double and even 
triple sulphur-substituted spliceosomes and 
substrates. As the number of substitutions 
increased, they observed a requirement for 
greater thiophilicity (sulphur-binding pref- 
erence) of the rescuing metal: Cd** provided 
rescue but Mn” did not. 

Titration of the Cd** concentration needed 
to achieve rescue suggests that there are two 
different catalytic metal ions within the active 
site (one coordinated to the nucleophilic 
hydroxyl and one coordinated to the leav- 
ing group), and that these metals interact 
with distinct phosphate oxygens in U6. 
Interestingly, no specificity switch was detected 
upon sulphur substitution in the RNaseH 
domain of Prp8, a protein at the core of the 
spliceosome that had been speculated to have a 
catalytic role’. 

The inorganic architecture of the splice- 
osome active site that emerges from these 
studies is strikingly similar to that seen in the 
self-splicing group II introns® (Fig. 1). This 
subclass of introns are able to splice them- 
selves out of the pre-mRNA without the need 
for the spliceosome, yet group II splicing and 
spliceosomal splicing involve the same two 
phosphoryl-transfer reactions and generate 
the same reaction intermediates and prod- 
ucts. The crystal structure of a group II intron® 
revealed an active site with two catalytic metal 
ions coordinated to phosphate groups in the 
domain V helix of the RNA that are spatially 
and functionally equivalent to the phosphates 
that Fica et al. show are coordinated between 
the U6 snRNA and the pre-mRNA. The par- 
allels between these molecules even extend 
to the stereochemistry of the oxygen atoms 
that serve as ligands for the two metal ions. 
Although it has long been known that the 
hairpin-loop structures of U6 and domain V 
are functionally similar, this work demon- 
strates the chemical equivalence between these 
two systems in their metal-ion coordination. 

There are more than 100 proteins in the 
spliceosome, so what role do they have if 
RNA is the catalyst of pre-mRNA splicing? 
Some proteins are retained throughout the 
splicing process, whereas others associate 


or dissociate at various steps in the reaction. 
Fica and colleagues’ data do not exclude the 
possibility that some of the catalytic metal 
ligands are provided by protein functional 
groups. The authors tested one amino-acid 
candidate in spliceosomal factor Prp8 and 
obtained a negative result, but other seg- 
ments of Prp8, particularly a loop close to the 
active site’, remain a possibility. In addition 
to the two catalytic divalent metals present in 
domain V of group II introns, this structure 
also contains two monovalent potassium (K*) 
ions, resulting in a cluster of four metal ions’. 
Does the inorganic architecture extend to the 
monovalent ions, or do proteins play the part 
of K* within the spliceosome? Whatever the 
answers to these questions, it seems that the 
main role of the proteins is not catalytic, but 
that they are instead involved in regulation, 
in determining splice-site specificity and as a 
scaffold to promote the proper conformation 
of the snRNAs. 

Although the group IJ intron structure is 
invaluable, we still lack high-resolution crys- 
tal structures of the spliceosome in any of its 
various reaction states. Such a structure would 
reveal the full relationship of U6 and its asso- 
ciated proteins to the catalytic metals and 
pre-mRNA substrate. The complexity, hetero- 
geneity and size of the spliceosome have so far 
precluded structural analysis at high enough 
resolution to extract biochemical mechanistic 
information, but cryo-electron microscopy 
images continue to improve*. There is also 
a need to confirm the mechanistic parallels 
between the spliceosome and the group II self- 
splicing system. The biochemical data avail- 
able on U6 have outpaced those for domain V, 
so it has not been tested whether the metals 
observed in domain V show a metal-specific- 
ity signature consistent with a direct role in 


SOLAR SYSTEM 


catalysis, as has now been demonstrated for 
the metal ions in U6. 

In the early days of our understanding of 
self-splicing RNAs, chemist Thomas Cech 
posed the question: is RNA-based catalysis 
a general phenomenon or a rare molecular 
fossil’? For many years, the study of RNA 
catalysis was confined to introns in genomes 
outside the nucleus and in non-model organ- 
isms, including yeast mitochondria and cili- 
ated protozoa. But the crystal structure of the 
ribosome” revealed that RNA provides the 
active site responsible for protein synthesis 
across all forms of life. The functional demon- 
stration by Fica et al. that pre-mRNA splicing 
is catalysed within an RNA metalloenzyme 
active site clearly shows that RNA-based catal- 
ysis is a fundamental and ubiquitous process 
involved at several steps in the conversion of 
genetic information to functional proteins. m 
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Russian skyfall 


The recent entry of a 20-metre-wide celestial rock into Earth’s atmosphere 
offered both a spectacular show and a source of invaluable data that advance 
our understanding of high-velocity impacts. SEE LETTERS P.235 & p.238 


NATALIA ARTEMIEVA 


wo papers in this issue focus on the 

scientific reconstruction of the aster- 

oid-impact event that was observed 
on 15 February 2013 in Chelyabinsk, Rus- 
sia — the largest impact event on Earth since 
the Tunguska blast of 1908. Borovicka et al. 
(page 235) extract substantial scientific 
information from several low-quality video 
recordings. Meanwhile, by analysing infra- 
sound airwaves and the brightness of the light 
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flash generated by the impact, as well as the 
damage caused to Earth's surface, Brown et al.” 
(page 238) estimate the total energy of the event 
to have been 400-600 kilotons of trinitrotoluene 
(1 kt of TNT is equivalent to 4.185 x 10” joules 
of energy), and say that such events occur more 
often than previously thought. 

There are three fundamental physical pro- 
cesses involved as meteoroids — asteroids’ 
smaller counterparts — enter the atmos- 
phere. First, atmospheric drag decelerates 
meteoroids from their initial velocity of about 
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20 kilometres per second to a free-fall velocity 
ranging from several centimetres to hundreds 
of metres per second, depending on the mete- 
oroid’s size. Second, atmospheric shock waves 
heat the air surrounding the meteoroid and 
make the body visible as a ‘bolide. The waves’ 
thermal radiation evaporates the meteor- 
oid, which may lose more than 90% of its 
initial mass before it hits the ground. Third, 
extremely high dynamic pressure breaks the 
meteoroid into smaller fragments, which in 
turn are subjected to even higher degrees of 
deceleration and evaporation. 

In his letter to this journal? in 1908, 
W.E. Denning described the night of 30 June 
in Bristol: “The firmament was abnormally 
luminous, with a very strong glow all over 
the north at midnight.” Two other letters on 
the same page (and others in Nature’s next 
issue) reported bright nights in Berlin, Lon- 
don and Prague. On the same day, atmos- 
pheric disturbances and seismic signals were 
registered across Eurasia, with the strongest 
reported in Russian Siberia. In 1927, a huge 
area of devastated forest was discovered near 
the Podkamennaya Tunguska River, also in 
Siberia. It took decades to connect all of these 
observations with a cosmic-body impact, now 
known as the Tunguska event, which carried 
a total energy of 10-15 Mt TNT (ref. 4). Taken 
together with the hypothesis” that an aster- 
oid impact caused the mass extinction at the 
Cretaceous—Palaeogene geological boundary 
66 million years ago, the Tunguska event high- 
lighted a new threat to the human race anda 
challenge for astronomers: to predict asteroid 
impacts in advance. 

So far, some 90% of near-Earth objects, such 
as comets and asteroids, larger than 1 kilo- 
metre in diameter have been monitored®. 
Smaller, Tunguska-like bodies are much more 
difficult to detect. However, the 2012 DA,, 
near-Earth asteroid, with a diameter of about 
30 metres, was discovered in February 2012. 


Figure 1 | The chunk of the Chelyabinsk asteroid that was pulled from Lake Chebarkul. 


A year later, in Tucson, Arizona, I was watch- 
ing NASAss news on television to find out how 
to observe the asteroid during its expected 
closest approach of some 27,700 km the next 
day. Several minutes later, breaking news 
came from a radio tuned to a Russian station, 
announcing that a superbolide had exploded 
over Chelyabinsk. 

My first thought was that the Russian media 
had misinterpreted NASAs news. But it was 
not a mistake. Thousands of people observed 
a bright flash in the sky followed by a power- 
ful sonic boom that destroyed windows 
across an area of about 5,000km?. More than 
1,500 people were injured, mainly by broken 
glass. Amateur video recordings of the event 
allowed Borovicka and colleagues to recon- 
struct the body’s trajectory and its fragmen- 
tation history: the entry angle was unusually 
low, approximately 17°; the observed trajectory 
in the atmosphere exceeded 250 km in length; 
several light flashes occurred below an altitude 
of 40 km; and hundreds of fragments were 
dispersed along the trajectory. 

Borovicka et al. say that the largest fragment 
(which they name F1) would have touched 
down in Lake Chebarkul. The recent recovery 
of this fragment (Fig. 1), which weighs some 
600 kg, is the best prize for the scientists. The 
authors also reconstruct the Chelyabinsk aster- 
oid’s space biography and connect its origin 
with the 86039 near-Earth asteroid, rather than 
the long-anticipated 2012 DA,, asteroid, which 
passed by Earth almost unnoticed, thanks to its 
loud precursor. Brown and colleagues estimate 
the pre-atmospheric diameter of the asteroid 
to be in the range of 17-20 m, with a total mass 
of about 10’kg. Only a tiny fraction of this 
mass, less than 0.01%, has been recovered on 
Earth’s surface. 

Both teams used standard models of impacts 
to make their calculations. Although these 
models have been validated for numerous 
small meteoroids, they may fail for meteoroids 
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that are larger than 10 m in diameter. For such 
objects, the collective behaviour of fragments 
plays a crucial part: evaporation could be 
more intense and account for the final mass 
deficiency, whereas dynamic pressure could be 
much lower, explaining why large fragments 
survive. It is to be hoped that the Chelyabinsk 
event will prompt researchers to refine their 
models. 

How common are Tunguska- and Chelya- 
binsk-like events — should we expect other 
guests from outer space any time soon? 
Standard methods’ predict a Tunguska-like 
event once every millennium, and a Chelya- 
binsk-like event once a century. Brown and 
colleagues are more pessimistic, suggesting a 
frequency that is tenfold higher for both types 
of event. We will probably have to wait some 
100 years to find out. 

And are astronomers capable of predict- 
ing the impacts of meteoroids a few metres 
in diameter? Probably not: such objects are 
too small to be detected in space by modern 
telescopes. One successful prediction was 
that of asteroid 2008 TC;, which was discov- 
ered by accident a day before it struck Earth’s 
atmosphere, and recovered later in Sudan 
as the Almahata Sitta meteorite’. If the tra- 
jectory, size and composition of a body are 
known, physicists can predict its behaviour — 
whether it will be harmlessly vaporized and 
fragmented in the atmosphere or whether 
it will cause serious damage to the planet’s 
surface (such as shock waves, wildfires and 
impact craters). 

Finally, can humans minimize the risk of 
impact-related injuries? Yes they can, by learn- 
ing a few basic facts about meteoroids and 
their interaction with the atmosphere — just 
as people living in seismically active areas are 
prepared for earthquakes. A meteoroid impact 
could happen at any time, anywhere on Earth. 
If you see a bright flash in the sky, do not panic; 
stay away from the windows; and find a secure 
spot to hide. If possible, use your mobile phone 
to take pictures and videos, and time the inter- 
val between the flash and the sonic boom (it 
may take up to a few minutes). Without your 
observations, a future impact event might 
remain as enigmatic as that at Tunguska. = 
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STEM CELLS 


Differentiated cells 
ina back-up role 


Two independent studies show that, if push comes to shove, differentiated cells 
of the stomach and lung can act as adult stem cells, generating various cell 
types of the tissues, including a pool of stem cells. SEE ARTICLE P.218 


TUSHAR J. DESAI & MARK A. KRASNOW 


hen functional cells die, they are 
soon replaced. In most cases, 
the replacement cells arise either 


from the division of surviving mature cells 
of the same class or from the division and 
differentiation of tissue stem cells. But what 
happens when resident stem cells are selec- 
tively depleted? Two papers, one by Tata 
et al.' on page 218 of this issue and the other 
by Stange et al.” published in Cell, find that 
following depletion of stem cells in the stomach 
or lung, stem-cell function can be recov- 
ered through a surprising back-up function 
provided by specific differentiated cells in 
each tissue. 

The stomach and lung are lined by a single 
layer of cells of various types that are continu- 
ally replaced throughout life as they become 
damaged or die. In the central part of the 
stomach, the cells lining periodic evaginations 
called crypts are found in a specific distribu- 
tion, organized by class (Fig. 1a). It is thought 
that cells in individual crypts are maintained 
by rapidly dividing tissue stem cells that reside 
just above the crypt’s midpoint and whose 
daughter cells spread in both directions, 
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differentiating into cells of various classes’. 

Stange and colleagues, working in mice, 
find that the base of the crypts contains cells 
that express Troy, a marker of intestinal stem 
cells. Using genetic techniques to ‘pulse- 
label these cells in a permanent and heritable 
manner at a low frequency, they occasionally 
find crypts in which all cells are derived from 
a Troy-expressing cell whose progeny slowly 
spread up from the base. When the authors 
destroyed the tissue stem cells, however, Troy- 
expressing cells executed this stem-cell func- 
tion much more rapidly and in many more 
crypts. Remarkably, the Troy-expressing cells 
are a type of fully mature secretory cell called 
a chief cell, which maintains its differentiated 
identity even while performing its stem-cell 
function. Because their regenerative func- 
tion is activated following depletion of the 
tissue stem-cell population, chief cells can be 
considered reserve stem cells. 

Tata et al. independently demonstrate that 
differentiated airway secretory cells known as 
Clara cells can contribute to regeneration in 
the lung. Previous work showed’*” that undif- 
ferentiated basal cells in the mouse trachea 
replenish the stock of secretory and multi- 
ciliated cells, which produce and clear airway 


b Lung 
Stem-cell 
descendants 
Clara cell 
Basal 
zi stem cell 
Multi-ciliated 
cell 
Chief-cell 
descendants 


mucus, respectively. In the present paper’, the 
investigators pulse-labelled mature secretory 
cells en masse before specifically killing basal 
cells. Surprisingly, they later found the lineage 
mark they had introduced before basal-cell 
destruction in newly arising basal cells. But 
Tata and co-workers’ bulk-labelling strategy 
is a potential caveat, because it may have inad- 
vertently marked some original basal cells 
that escaped destruction. It would be valuable 
to conduct studies using a sparse-labelling 
strategy, to trace the behaviour of individual 
secretory cells. 

These authors go on to show that the marked 
basal cells, presumably descendants of labelled 
mature secretory cells, function as stem cells, 
renewing both multi-ciliated and secretory 
cell types (Fig. 1b). Because their progenitor 
activity is elicited only after elimination of 
basal stem cells, tracheal Clara cells can also 
be considered reserve stem cells. 

Although the differentiated Clara cells of the 
lung and chief cells of the stomach each give 
rise to multiple cell types, the routes they take 
are different. Clara cells generate replacement 
stem cells, whereas chief cells apparently bypass 
this requirement and are themselves stem cells. 
However, lower down in the airway tract, Clara 
cells seem to be stem cells, renewing themselves 
and multi-ciliated cells without first becoming 
basal cells®. Conversely, chief cells also seem to 
generate stem cells, albeit indirectly, because 
their descendants eventually replace all crypt 
cells, including the resident stem-cell popu- 
lations. Thus, despite taking different routes, 
these mature cells share the potential to gener- 
ate both differentiated cells and stem cells. 

The two papers challenge the primacy of 
undifferentiated, resident stem cells, given 
that mature cells can substitute for their func- 
tion and even make new ones. In other tissues, 
differentiated cells may similarly provide a 
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Figure 1 | When mature cells function as reserve stem cells. a, In the crypts of the stomach, undifferentiated adult stem cells (such as the orange cell) occupy 
the stem-cell zone, whereas functional, enzyme-secreting chief cells (blue) reside at the base. During cellular turnover, stem cells generate all cell types of the 
crypt (orange). But Stange et al.’ find that if stem cells are destroyed, mature chief cells assume a stem-cell role, renewing even the depleted stem cells. Chief cells 
infrequently renew crypts in the absence of obvious injury (dotted arrow). b, The adult tracheal stem cells called basal cells (orange) replenish the complement 
of secretory Clara cells and multi-ciliated cells under normal physiological conditions and after injury. When Tata et al.' destroyed these stem cells, Clara cells 
became activated to regenerate basal cells, which resumed the task of maintaining tracheal cell types. 
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reserve stem-cell function when the primary 
renewal mechanisms are inadequate’*. The 
new studies also raise questions, such as what 
reprogramming factors regulate stem-cell 
behaviour in mature cells, and whether rever- 
sion to an undifferentiated state is an obligate 
step. Also, which cells generate the primary 
stem-cell population in a tissue? And how 
is an appropriate balance between mature 
cells and different types of stem cells within a 
tissue maintained? Tata et al. provide evidence 
that, in the trachea, contact between Clara 
cells and basal cells or short-range inhibition 
of Clara-cell dedifferentiation by basal cells 
may play a part. 

The pursuit of these questions may have 
implications for regenerative medicine, given 


QUANTUM PHYSICS 


that there is an intrinsic appeal to the shorter 
path in redirecting differentiation of a mature 
cell instead of starting from scratch with an 
undifferentiated stem cell. Equally important 
is the possibility that these ‘reserve’ programs 
can be activated in differentiated cells in vivo 
by extrinsic signals. This would eliminate the 
need to introduce cellular reprogramming 
factors, and thereby avoid the attendant risk 
of promoting cancer through this form of 
potential therapy. = 
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The right ambience 
for a single spin 


Long-lived single electron spins are crucial for quantum computation and for 
understanding spin dynamics. A remarkably long lifetime — of the order of 
minutes — has now been obtained for a solid-state system. SEE LETTER P.242 


MICHAEL E. FLATTE 


nyone who has had a pleasant dinner at 
a favourite restaurant ruined by noisy 
neighbours understands the disrup- 
tion caused by too much interaction with 
one’s environment. Most electronic spins in 
a solid are also buffeted by myriad naturally 
occurring ‘noises, including nearby fluctu- 
ating electronic motion (spin-orbit inter- 
actions), interactions with other electronic or 
nuclear spins, and the mechanical motion of 
ions. The resilience of single-spin dynamics 
to these environmental effects is quantified by 
the spin coherence time or the closely related 
zero-field spin lifetime. Just as one might make 
modifications to soundproof a restaurant to 
improve the ambience for diners, so reducing 
the noises influencing a single spin by lower- 
ing the temperature, eliminating nuclear spins, 
and choosing solids made up of light atoms 
that have weak spin-orbit interactions, leads 
to long spin coherence times. Unfortunately, 
these methods also limit the materials in which 
long spin coherence times can be observed. But 
a deaf diner is impervious to noisy neighbours, 
and on page 242 of this issue, Miyamachi 
et al.’ demonstrate an approach to making 
a single spin deaf to the dominant noises 
around it. 
The system studied by Miyamachi and 
colleagues is a single holmium (Ho) atom 
adsorbed on the surface of platinum (Pt). The 


Ho atom has an electronic spin of 8, and its 
lowest-energy spin states correspond to the 
spin pointing towards the surface or away from 
the surface; these two spin states are degenerate 
(of equal energy). A dominant source of noise 
for the electronic spin of an atom adsorbed on 
the surface of a metal comes from a passing 
conduction electronic spin, which interacts 
with the adsorbed spin and changes its orien- 
tation, transferring one quantum of angular 
momentum. When this occurs, the spin ori- 
entation of the adsorbed spin also changes, so 
the rate of this process can limit the coherence 
time of the adsorbed spin. 

Lengthening the spin lifetime by reducing 
the interaction with the environment has 
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been demonstrated for single spins on met- 
als, by building an insulating barrier between 
the adsorbed spin and the metal underneath’. 
In Miyamachi and colleagues’ experiment, 
the Ho atom is adsorbed directly on a Pt 
surface (technically known as the (111) sur- 
face) chosen so that all the Pt surface atoms 
are arranged in regular, repeating equilateral 
triangles. The Ho atom sits in the centre of one 
of those triangles, and from its vantage point 
the surface would look the same if the entire 
surface were rotated 120° around it. Ordinarily, 
the presence of these neighbouring Pt atoms, 
combined with the spin-orbit interaction, 
would push the adsorbed spin into a quantum- 
mechanical state that is a superposition of the 
two low-energy (up and down) states, corre- 
sponding to anon-degenerate ground state for 
the Ho atom’s electronic spin that has a vanish- 
ingly small spin orientation, and thus a short 
spin lifetime. 

Here, however, it is this three-fold symmetry 
(so called because three rotations of 120° bring 
the surface back to its original configuration) 
that deafens the spin to its surroundings. The 
authors showed that for this geometric posi- 
tion of the Ho atom, and for the Ho atom’s 
spin of 8, a transition from the Ho atom spin 
pointing away from the surface to it point- 
ing towards the surface is not caused by the 


NV- in diamond 


Figure 1 | Energy-level structure of spin systems. The energy splitting between the two degenerate 
(equal energy) spin states and a third spin state for a holmium (Ho) atom on a platinum (Pt) surface is 
three orders of magnitude larger than for a nitrogen vacancy (NV ) centre in diamond. Furthermore, the 
two degenerate states are the ground states of the system, whereas the ground state of the NV centre is a 
single state. This energy-level structure and large energy splitting for Ho on Pt was shown by Miyamachi 
et al.', for temperatures corresponding to energies much less than the splitting, to eliminate spin processes 
that would reduce the lifetime of the electronic spin of the Ho atom. 
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presence of the neighbouring atoms, and also 
cannot directly occur through an interaction 
with a passing electron’s spin — the Ho atom 
spin is oblivious to those interactions. 

The transition remains possible with two 
spin-flips with passing electrons, but this 
requires the Ho spin to be in an intermedi- 
ate orientation between the two spin-flips, 
and being in such an orientation costs a lot of 
energy. For Ho on Pt(111), the most impor- 
tant intermediate orientation is about 8 milli- 
electronvolts (or, equivalently, 100 kelvin) 
above the ground state (Fig. 1). Thus, if the 
temperature is much less than 100 K, the two- 
spin-flip transition is extremely unlikely. As a 
result, at a temperature of 1.1 K, Miyamachi 
and colleagues measured a spin lifetime that 
exceeded 6 minutes — a remarkably long value 
for any solid-state spin system. 

How does the lifetime of the Ho electronic 
spin compare with that of single spins in semi- 
conductors? Can a similar energy-level struc- 
ture be obtained, with the spin-flip of a passing 
electron unable to cause a transition between 
the two degenerate lowest-energy states? This 
configuration occurs for a manganese atom 
doped into gallium arsenide. In the bulk of 
gallium arsenide, a manganese atom has a 
three-fold (angular momentum 1) degenerate 
ground state, but the application of an electric 
field’ or a strain field* can push one state much 
higher in energy, producing a spin-state level 
structure that should have long coherence 
times. Spin coherence times for manganese 
in gallium arsenide, however, do not exceed 
10 nanoseconds’ because of interactions with 
nuclear spins. 

Another interesting comparison is with 
an electronic spin system called the nitro- 
gen vacancy (NV ) centre in diamond*, in 
which a nitrogen atom and a vacancy replace 
two neighbouring carbons. That spin centre 
also has three-fold symmetry, and has two 
degenerate states. Therefore, the fundamental 
energy-level structure looks similar to that of 
Ho on Pt(111). The degenerate states of the 
NV centre, however, are not ground states; 
the ground state is a single state and is split 
in energy from the other two states by only 
5.6 microelectronvolts (Fig. 1).Thus, to keep 
long spin coherence times (a few milliseconds 
at room temperature”), the spin-orbit inter- 
action must be very small. 

Miyamachi and colleagues’ exceptionally 
long lifetimes for single spins adsorbed on 
a metal strongly support the view that new 
single-spin candidates with improved fun- 
damental properties can still be found, by 
careful consideration of the geometry and 
symmetry of the single spin within its envi- 
ronment. For a long-lived single spin, the 
effective coherence time can also be dra- 
matically enhanced by careful selection and 
application of pulses of radiation’. To take 
full advantage of the long spin lifetimes of 
Ho on Pt(111) for spin-based computation or 


temporary information storage, high-speed 
manipulation techniques must be developed 
for controlling the single-spin dynamics, such 
as have been demonstrated for single spins 
in diamond”. m 
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The ecological deficit 


The almost complete extinction of small mammals in forest islands within 
25 years of the construction of a reservoir that fragmented the habitat provides 
a striking example of delayed biodiversity loss. 


ANDREW GONZALEZ 


lobal rates of extinction are set to peak 

this century’. But it is often difficult to 

predict when and where species losses 
will occur, and this hampers efforts to prevent 
them. Writing in Science, Gibson et al.” pro- 
vide an example of things to come, identifying 
a combination of habitat fragmentation and 
the arrival of an invasive species as the cause 
of the extinction of an entire community of 
small mammals. The extinction sequence was 
revealed by repeated surveys of the mamma- 
lian fauna inhabiting a set of tropical forest 
islands in southern Thailand that were isolated 
in 1986 following inundation of the forest to 
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create the Chiew Larn reservoir. A mathemati- 
cal model of the biogeography of these islands 
allowed the authors to estimate the extinction 
half-life of the mammal communities to be 
around 14 years — a striking example of an 
extinction debt. 

Extinction debt is the idea that species may 
continue to be lost long after an initial dis- 
turbance’. Decades of research suggest that 
small islands of habitat cannot sustain the 
rich diversity of species they harboured at 
the time of isolation. Instead, diversity on 
habitat islands decays exponentially over time 
in a process called community relaxation’. 
The number of species lost from the start 
to the end of the relaxation process is the 
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Figure 1 | Isolation leads to extinction. Gibson et al.” document that within 25 years of the creation 
of islands by the construction of the Chiew Larn reservoir in Thailand, the diversity of native small 
mammals on the islands had decreased from up to 12 species to just one. On most of the islands, the 
only remaining small mammal was the invasive species Rattus tiomanicus, which the authors suggest 


contributed to the extinctions of native species. 
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size of the extinction debt (Fig. 1). 

According to theory’, the rate at which an 
extinction debt is paid depends on the balance 
between the rate of extinction and the rate of 
immigration of species to habitat islands. Ina 
newly isolated forest island, the rate of immi- 
gration may be so low that it cannot compen- 
sate for the high rate of local extinction. 

Gibson and colleagues compared the num- 
ber of mammalian species on 16 islands in the 
Chiew Larn reservoir with the biodiversity at 
sites on the nearby mainland, which contained 
up to 12 species of small mammal. They found 
that, after only five years of isolation, the small- 
est islands (those less than 10 hectares in area) 
harboured one to three native species, and that 
after 25 years even the largest islands (up to 
56 ha) had fewer than five species. Surprisingly, 
they found that the larger the habitat island, 
the faster the loss of species. 

The authors show that the effects of for- 
est fragmentation were exacerbated by an 
invasive species. They identify the Malayan 
field rat (Rattus tiomanicus) as the culprit 
because on many of the islands it was the 
only small mammal remaining. This species 
is not naturally found in undisturbed tropi- 
cal forests in this region, but it is common in 
human-transformed environments, such as 
agricultural fields and villages. When R. tio- 
manicus is common it often has devastating 
effects on native fauna. The authors believe 
that it may have spread to the islands after the 
area was inundated, and so was a major factor 
that accentuated the extinction-immigration 
imbalance and contributed to the eventual 
collapse of the island communities. 

This study is exceptional because it com- 
bines field observations with an insightful 
mathematical model to reveal the process of 
biodiversity loss. It builds on long-term con- 
trolled experiments on several continents that 
revealed that habitat fragmentation causes 
extinction debts®. We now have accumulated 
evidence from field surveys and experiments 
that fragmentation has a lasting legacy. But 
further research is needed to examine how 
different drivers of extinction — such as cli- 
mate change and exotic species — interact with 
habitat fragmentation to affect the rate and 
extent of an extinction debt. If we are to make 
accurate predictions of biodiversity loss, we 
must update extinction theory to account for 
the many effects humans have on biodiversity’. 

An important but often overlooked con- 
sequence of an extinction debt is the impact 
it can have on ecosystem functions and ser- 
vices. This parallel idea ofan ecosystem-func- 
tion debt® arises from research showing that 
biodiversity influences the way ecosystems 
function. Consider, for example, the delayed 
effects of forest fragmentation on the diversity 
of wild pollinators and the economic legacy of 
a declining pollination service on agriculture 
and wild plant populations’. Or consider as 
well that habitat fragmentation may ultimately 


result in the emergence of novel diseases car- 
ried by animals that can successfully invade a 
landscape only after substantial extinction has 
taken place’’. These ecosystem impacts con- 
nect the extinction debt to human well-being. 

So what can be done to minimize the causes 
and consequences of extinction debts? In a 
landscape composed of many habitat frag- 
ments, the creation of a reserve network can 
help to rebalance local extinction with immi- 
gration. Network science suggests that the size 
and connectivity of reserve networks can be 
designed to optimally maintain biodiversity” 
and sustain ecosystem services in the long 
term, while taking into account real-world 
economic constraints. In some cases, habitat 
connectivity can be restored by providing cor- 
ridors to connect habitats; in others, creating 
close clusters of habitat fragments can achieve 
the same goal. Engineering the configuration 
of landscapes to minimize the effects of extinc- 
tion debts is not the stuff of science fiction, but 
much more research is needed before we will 
know how best to do it. 

Understanding extinction debts will 
improve our ability to predict when biodiver- 
sity loss will peak this century. Gibson and col- 
leagues have provided a powerful and poignant 
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example of the science underlying this process, 
and their findings serve as a reminder that 
many extinctions are still to come. m 
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Antibodies advance 
the search for a cure 


Efforts to make a prophylactic HIV vaccine have identified monoclonal antibodies 
that potently suppress viral replication. Studies in monkeys show that these 
reagents effectively treat HIV infection. SEE ARTICLE P.224 & LETTER P.277 


LOUIS J. PICKER & STEVEN G. DEEKS 


major breakthrough from the HIV- 

vaccine research community in recent 

years was the isolation and characteri- 
zation of novel antibodies from HIV-infected 
people that have the remarkable ability to 
efficiently neutralize most circulating HIV 
strains’*. The antibodies’ mechanism of action 
involves the recognition and blockade of evolu- 
tionarily conserved, functionally crucial struc- 
tures of the HIV viral envelope. These unusual 
antibodies have reinvigorated the effort to 
develop an antibody-based prophylactic HIV 
vaccine by defining effective human antibody 
responses to the virus and providing a ‘map for 
reverse engineering of vaccines that recapitu- 
late these responses. Because these antibodies 
develop infrequently, emerge only after many 
years of HIV infection and are characterized 
by a high degree of mutation’, this task is not 
likely to be quickly accomplished. But that 


does not necessarily mean that the clinical 
benefit of these antibodies is relegated to the 
distant future. On the contrary, two reports in 
this issue, by Barouch et al.’ (page 224) and 
Shingai et al.* (page 277), demonstrate that 
combinations of such antibodies drastically 
reduce virus levels in chronically infected rhe- 
sus macaques, bolstering the hope that such 
therapies might be effective in humans. 

The two groups of investigators examined 
the ability of several of these potent and broadly 
HIV-neutralizing human monoclonal anti- 
bodies (mAbs) to suppress viraemia — the 
presence of the virus in the bloodstream — in 
rhesus macaques chronically infected with 
pathogenic (AIDS-causing) simian—human 
immunodeficiency virus (SHIV) hybrids 
that use the HIV envelope for cell entry. Their 
findings were remarkably consistent: admini- 
stration of cocktails of two or more mAbs 
(directed at two different structural regions, 
the CD4-binding site and V3-loop/N332 
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Figure 1 | Differences between conventional ART and mAb therapy. a, Conventional antiretroviral 
therapy (ART) involves combinations of small-molecule drugs that act at various cellular sites, protecting 
uninfected target CD4" T cells from HIV infection by inhibiting viral entry or post-entry viral function. 
But this treatment does not affect the production of virus by infected cells, nor facilitate the clearance of 
virus particles. Persistent virus-producing cells and circulating virus can reignite progressive infection 

if ART is discontinued. Free virus can also activate cells, contributing to a chronic inflammatory 
environment. b, Monoclonal antibodies (mAbs) that bind to and inactivate viral envelope proteins 
effectively neutralize the virus and therefore also prevent the spread of infection. These may have the 
added advantage of facilitating the clearance of virus particles and killing virus-producing cells by 
antibody-dependent mechanisms, in particular antibody-dependent cell-mediated cytotoxicity (ADCC), 
thereby potentially diminishing the viral reservoir and preventing local immune activation. 


glycans) profoundly suppressed viraemia, often 
to below the level of detection. This suppres- 
sion persisted for weeks, as long as mAb levels 
were maintained. Furthermore, mutations 
associated with resistance to the neutraliza- 
tion activity of the administered mAbs were not 
generally observed in the virus that rebounded 
after mAb levels waned. Remarkably, Barouch 
and colleagues show that one antibody, the 
extraordinarily potent N332-glycan-dependent 
PGT121 mAb’, was able to provide prolonged 
viral suppression when used alone. 

Given that current antiretroviral therapy 
(ART), which is based on small molecules that 
target different phases of the HIV life cycle, 
are effective and can be given orally, whereas 
mAbs need to be delivered by injection, why is 
this approach of interest? The answer lies in the 
different mechanisms of viral suppression by 
conventional ART versus neutralizing mAbs. 
Current ART drugs prevent the virus from 
spreading from an infected cell to the next 
round of target cells, but they do not directly 
kill infected cells nor inhibit the cells’ produc- 
tion of virus particles® (Fig. 1a). In the absence 
of viral spread, virus production declines as 
infected cells die by virus- or immune-medi- 
ated killing or by programmed cell death. But 
some infected cells persist and produce virus 
particles indefinitely. This rare but enduring 
reservoir of virus-producing cells is the source 
of viral rebound when people stop ART, and 


probably contributes to persistent inflamma- 
tion and immune dysfunction (and associated 
morbidities) in individuals on ART”. 
Treatment with neutralizing mAbs would, 
like conventional ART, prevent viral spread, 
and this undoubtedly accounts for much of 
the antibodies’ observed activity in these two 
studies. However, mAbs have intrinsic effector 
functions that would not only facilitate rapid 
clearance of virus particles produced by 
residually infected cells, but might also accel- 
erate the destruction of virus-producing cells 
by antibody-dependent cell-mediated cyto- 
toxicity, complement lysis or phagocytosis 
(Fig. 1b). Although these processes were not 
directly demonstrated in either of the new 
studies, Barouch et al. show accelerated viral 
clearance in mAb-treated macaques com- 
pared with that seen in humans given con- 
ventional ART. They also provide evidence for 
declining levels of cell-associated HIV DNA 
(a measurement of the number of infected 
cells) and improved function of SHIV-specific 
CD8* T cells (antiviral immune cells) in mAb- 
treated animals. Perhaps more interestingly, 
Barouch and colleagues report that the viral- 
load set points — the stable level of viraemia 
reached once a balance is established between 
viral replication and the immune system — 
of the animals after therapy cessation were 
consistently lower than before treatment, 
suggesting that mAb treatment enhanced 
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the antiviral immune response. 

There are several caveats to this exciting 
work that deserve mention. One is the substan- 
tial animal-to-animal variability in pharmaco- 
kinetic outcomes, which would have to be 
addressed iflong-term administration of these 
antibodies were pursued in people. Another is 
the fact that the high mutability of HIV means 
that slightly different envelope proteins will 
be expressed by virus strains both within an 
individual and across populations, making it 
likely that some people will harbour viruses 
that are resistant to one or more of the thera- 
peutic mAbs. In addition, whether the mAbs 
can cross the blood-brain barrier and suppress 
viral replication in the central nervous system 
remains an open question. Finally, the extent to 
which viral envelope protein is exposed on the 
surface of long-lived infected cells is unknown. 
Additional treatment with virus-activating 
agents will probably be needed for antibody- 
mediated destruction to have a meaningful 
impact on this reservoir. 

Despite these limitations, the findings of 
these two papers could revolutionize efforts 
to cure HIV. The combination of conventional 
ART and neutralizing mAb therapy might 
reduce viral replication more effectively than 
conventional ART alone, mediate destruction 
of virus-producing cells arising from latent 
infection, diminish deleterious, generalized 
immune activation, and facilitate the effi- 
cacy of T-cell-targeted therapeutic vaccines 
intended to destroy latent cells*. At the very 
least, these results will catalyse collaborations 
between the massive teams of experts who 
have for decades worked on HIV prevention 
and treatment in separate venues. As previ- 
ously emphasized”"®, true multidisciplinary 
science will be needed if we are to generate a 
cure for HIV. = 


Louis J. Picker is at the Vaccine and Gene 
Therapy Institute, Oregon Health & Science 
University, Beaverton, Oregon 97006, USA. 
Steven G. Deeks is in the Department 

of Medicine, University of California, 

San Francisco, San Francisco, California 
94110, USA. 

e-mails: pickerl@ohsu.edu; 
sdeeks@php.uscf.edu 


1. Mascola, J. R. & Haynes, B. F. Immunol. Rev. 254, 

225-244 (2013). 

Klein, F. et al. Science 341, 1199-1204 (2013). 

Barouch, D. H. et al. 503, 224-228 (2013). 

Shingai, M. et al. Nature 503, 277-280 (2013). 

oldt, B. et al. Proc. Nat! Acad. Sci. USA 109, 

18921-18925 (2012). 

6. Volberding, P. A. & Deeks, S. G. Lancet 376, 49-62 

(2010). 

7. Palmer, S. et al. Proc. Nat! Acad. Sci. USA 105, 

3879-3884 (2008). 

8. Deeks, S. G. et al. Nature Rev. Immunol. 12, 

607-614 (2012). 

9. Deeks, S., Drosten, C., Picker, L., Subbarao, K. & 

Suzich, J. Nature Med. 19, 30-34 (2013). 

10.Richman, D. D. et al. Science 323, 1304-1307 
(2009). 


CIP Oh 


This article was published online on 30 October 2013. 


REVIEW 


doi:10.1038/nature12608 


Sound and heat revolutions in phononics 


Martin Maldovan!? 


The phonon is the physical particle representing mechanical vibration and is responsible for the transmission of 
everyday sound and heat. Understanding and controlling the phononic properties of materials provides opportunities 
to thermally insulate buildings, reduce environmental noise, transform waste heat into electricity and develop 
earthquake protection. Here I review recent progress and the development of new ideas and devices that make use of 
phononic properties to control both sound and heat. Advances in sonic and thermal diodes, optomechanical crystals, 
acoustic and thermal cloaking, hypersonic phononic crystals, thermoelectrics, and thermocrystals herald the next 


technological revolution in phononics. 


our society and daily lives. Their remarkable innovations have been 

based primarily on our improved ability to manipulate two particles: 
electrons and photons. In particular, the control of electrons in semi- 
conductor materials has generated fundamental changes, with laptops, 
mobile telephones and digital cameras now products that seem always to 
have existed. Analogously, the development of materials and devices 
with which to control photons has generated major changes in society, 
such as wireless communication and the use of optical fibres and micro- 
waves. The successful management of the electromagnetic spectrum is 
demonstrated by the wide range of frequencies controlled in electro- 
magnetic devices, which extends over 14 orders of magnitude, ranging 
from positron emission tomography (PET) scanning at frequency 
f= 10°° Hz to amplitude-modulation (AM) radios at f~ 10° Hz. 

Besides electrons and photons, another everyday particle is the phonon, 
which is responsible for the transmission of sound and heat. Given the 
many applications of our remarkable success in managing electrons and 
photons, it would be valuable to achieve a similar degree of control over 
the particle that accounts for both sound and heat (Fig. 1). Although some 
sonic and thermal devices and materials, such as medical ultrasound 
imaging machines and thermal insulation materials, are well known, 
further applications for phononic devices are developing fast, from thin 
acoustic metamaterials that can soundproof rooms to enhanced ther- 
moelectric devices that can use our bodies’ waste heat to power portable 
electronic devices. 

In this review, I provide insights on newly developed phononic materials 
and devices that have recently been shown to have remarkable physical 
properties—making it possible to manipulate sound and heat with unpre- 
cedented precision. I begin with a classical description of small-scale 
phononic crystals, followed by recent progress on unidirectional sound 
transmission in acoustic diodes—analogous to electronic diodes—and 
new developments on acoustic cloaking using metamaterials. I examine 


| n recent decades, major technological revolutions have transformed 


1 kHz 1 MHz 


1 Hz 
Frequency 


(Hz) 1 101 102 103 104 1405 108 107 108 4109 


Infrasoundl 


Music 


1 GHz 


novel interactions between sound and light in resonant cavities, and 
discuss new advances in thermal diodes and thermal metamaterials. I 
also provide an overview on thermoelectric materials (which convert heat 
into electricity) and discuss novel thermocrystals, which treat heat like 
sound and bring phonon management to the boundary zone between 
sound and heat. These recent ideas and concepts have improved our 
ability to manage the phononic spectrum, providing exciting opportun- 
ities to develop a large variety of devices that can control sound and heat. 


Hypersound and heat control by phononic crystals 
Sound and heat can both be described as mechanical vibrations trans- 
mitted through the atomic lattice. One difference between them, how- 
ever, is that most sound waves oscillate at low frequencies (kilohertz) 
and propagate over large distances, whereas most heat vibrations oscil- 
late at high frequencies (terahertz) and travel small distances. These 
different features lead scientists and engineers to employ different strat- 
egies to control sound and heat propagation. Essentially, macroscale and 
microstructured materials are able to manipulate sound and hypersound 
(very high-frequency sound, with f~ 1 GHz) frequencies, whereas to 
control heat, nanostructures are generally required. The development 
of sonic and thermal devices thus requires the design, fabrication and 
characterization of composite materials ranging from the centimetre 
scale to the nanometre scale. 

Over the past two decades, the propagation of sound waves with 
frequencies in the range from kilohertz to megahertz has been efficiently 
controlled by phononic crystals—artificial periodic structures made of 
two elastic materials'” (Fig. 2). Although these structures were initially 
designed to control sound, in recent years small-scale phononic crystals 
have been successfully used to control hypersound and heat. In phono- 
nic crystals, mechanical waves with frequencies within a specific range 
are not allowed to propagate within the periodic structure. This forbid- 
den frequency range—the ‘phononic bandgap’ (PBG)—allows sound to 
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be controlled in many useful ways in structures that can act as sonic filters, 
isolators, waveguides or resonant cavities. Owing to wave interference 
effects, PBGs occur for phonon wavelengths 4 that are comparable to the 
structure periodicity a (Fig. 2). For experimental convenience, a large 
number of phononic crystals have been fabricated at macroscopic scales 
(a~ 10cm-1 mm) to control sound of frequency f~ 10° Hz and ultra- 
sound of f~ 10° Hz (refs 3-7), finding promising applications in acous- 
tics, medical diagnosis and remote sensing. In recent years, however, the 
challenge has been to fabricate phononic crystals having small periodici- 
ties a< 1 um, to control high-frequency (f~ 10°-10'* Hz) phonons*". 
Given that the periodicity of small-scale (a ~ 1 sm) phononic crystals 
can be comparable to the wavelengths of light, these crystals can provide 
bandgaps for both hypersound and light, suggesting new mechanisms 
with which to enhance phonon-photon interactions'”. By additionally 
reducing the periodicity to the nanometre scale, the applications of small- 
scale phononic crystals can be extended from sound to heat, because 
PBGs could in principle exist in the terahertz regime. As shown later, 
however, this reasoning requires a more precise analysis owing to the 
different characteristics of sound and heat transfer. 

A small-scale phononic crystal for hypersound frequencies f ~ 1 GHz 
has been fabricated using interference lithography*. This technique pro- 
vides single-crystalline periodic structures by transferring a laser inten- 
sity pattern into a photosensitive material. The phononic crystal consists 
ofa triangular array of air holes in an epoxy matrix, with lattice periodicity 

= 1.36 1m (Fig. 2c). To characterize the phonon modes, Brillouin light 
scattering is employed, which provides complete phononic band dia- 
grams for the dispersion relation for phonon modes propagating within 
the crystal (Fig. 2). Other small-scale phononic crystals were constructed 
by self-assembling single-crystalline colloidal films of polystyrene spheres 
into a face-centred-cubic lattice’. The small size of the spheres used 
(d = 256 nm) guarantees operation in the hypersonic frequency range. 
By employing Brillouin light scattering to map the dispersion relation 
of longitudinal phonons, a hypersonic (gigahertz frequency) PBG was 
measured. The significance of these small-scale architectures is that, in 
addition to providing simultaneous gaps for hypersound and light, they 
greatly extend the use of phononic crystals from sonic and ultrasonic 
frequencies to the hypersonic regime* °. This increases the range of pho- 
nonic frequencies that can be controlled by three orders of magnitude. 

More recently, a state-of-the-art nanometre-scale phononic crystal 
was fabricated by transferring the patterns of two intersecting arrays of 
platinum nanowires into a silicon epilayer (Fig. 2c)’. The phononic 
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Figure 2 | Phononic crystals. a, 1D, 
2D and 3D phononic crystals made 
of two different elastic materials 
arranged periodically. Different 
colours represent materials with 
different elastic properties. b, An 
example of a phononic band diagram 
@ = @(k) for a two-dimensional 
phononic crystal, in which 
non-dimensional frequencies 
qa/2ncr (with cy a transverse 
velocity) are plotted versus the 
wavevector k along the J-X-M-I" 
path in the square Brillouin zone. 
The range of forbidden frequencies, 
or PBG, is shown in orange. c, 2D 
phononic crystals with periodicities a 
in the centimetre range (left), the 
micrometre range (middle) and the 
tens-of-nanometres range (right) can 
be used to control sound, hypersound 
and heat, respectively. Images are 
taken from fig. 1 of ref. 3 (left image, 
with permission), fig. 1a of ref. 8 
(middle image) and fig. 1b of ref. 11 
(right image, with permission). 


‘nanomesh’ consists of a square-lattice silicon matrix of air holes, with 
a = 34nm. The key feature of this structure is that it reduces the thermal 
conductivity of bulk silicon by almost two orders of magnitude. Heat in 
silicon is mostly carried by phonons with 10'* Hz frequencies, so this 
demonstrates that reducing the size of phononic crystals to the nano- 
metre scale can modify and control the propagation of high-frequency 
phonons, extending the applications of phononic crystals from sound 
to heat'!. As I explain below, however, the mechanism behind the reduc- 
tion of the thermal conductivity is not based on coherent interference 
and PBGs—as in classical phononic crystals—but is mainly due to the 
diffuse scattering of phonons at interfaces. 

With the recent progress in synthesis and characterization of small- 
scale materials, it is now possible to fabricate phononic crystals with 
periodicities ranging from centimetres to nanometres*"!. These advances 
have significantly increased the ability to manage the phonon spectrum, 
where the frequencies that can now be controlled extend from 10° Hz to 
10'* Hz, a range of nine orders of magnitude. As I showin this review, the 
wide range of control over phonon frequencies—comparable to the wide 
range of control over electromagnetic frequencies—allows the realization 
of many exciting devices that can manipulate phonons at all relevant 
frequencies: sound, ultrasound, hypersound and heat. 


Acoustic diodes 


In electronics, a diode is a well-known device that allows electrical current 
to pass in one direction but not in the opposite direction. This funda- 
mental principle—which led to a revolution in electronics—has recently 
been demonstrated for sound waves'*"*. The development of the acoustic 
counterpart of the electronic diode opens up exciting opportunities for 
novel applications in different research areas ranging from biomedical 
ultrasound imaging to environmental noise reduction. 

A sonic diode showing unidirectional transmission was constructed 
using a one-dimensional (1D) phononic crystal made of alternating 
layers of glass and water coupled to a nonlinear acoustic medium’*"* 
(Fig. 3a). A sound wave incident from the left, with frequency «@ inside 
the PBG, is partially converted within the nonlinear medium to a sec- 
ondary wave with frequency 2«. Because « lies in the PBG, the original 
wave is reflected backwards, but the secondary wave with frequency 2a 
outside the PBG can freely pass through the structure (Fig. 3a). On the 
other hand, if the sound wave is incident from the right, the wave is 
completely reflected in the backward direction because « lies inside the 
PBG. Asa result, a sound wave with frequency « can pass through the 
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system only along the forward direction, providing unidirectional sound 
transmission. In experiments", the layers were about 1 mm thick, allow- 
ing the diode to operate at ultrasound frequencies in the megahertz range. 

An even more interesting approach using linear materials has been 
introduced by means of a two-dimensional (2D) phononic crystal made 
of a square lattice of steel rods in contact with a diffraction structure’® 
(Fig. 3b). The critical feature in this diode is that one of the external 
surfaces is ‘smooth’ while the other is ‘rough’. Waves incident on the 
smooth surface (reverse direction) with frequencies « inside the PBG 
(~10 kHz) are reflected backwards because they fall into the forbidden 
frequency range. In contrast, waves incident on the rough surface (for- 
ward direction) can pass through partially. Because the outgoing waves 
in the forward direction are not parallel to the incident waves, the acoustic 
energy is converted through high-order diffractions to other modes with 
different spatial frequencies to overcome the barrier imposed by the 
PBG"*. The main advantage of the linear acoustic diode is that sound 
waves do not change their frequencies as they pass through the structure. 
Also, its easy fabrication method opens up opportunities for on-chip 
planar acoustic circuits with applications in logical sound processing. 

Sound rectification has also been demonstrated in 1D granular crystals'” 
consisting of a chain of steel spheres interacting nonlinearly (Fig. 3c). 
This system allows wave propagation for frequencies lower than a certain 
cut-off wc. When a defect sphere is added, a localized mode with fre- 
quency Wp > @c¢ is created around the defect. An incident wave with 
@ > Wc, and below wp, along the reverse direction (right to left) cannot 
propagate through the crystal because of the PBG. However, if moving 
along the forward direction (left to right), a wave with frequency o— 
having low amplitude—does not pass through but excites the vibrational 
mode localized around the defect sphere. If the wave’s amplitude is 
increased, its acoustic energy is converted to modes with frequencies 
outside the gap (w <@c) that can freely pass through the structure, 
generating unidirectional transmission. 

The experimental realizations of acoustic diodes, providing unidir- 
ectional sound transmission, are major steps towards complex acoustic 
devices that could greatly increase the efficiency of ultrasound medical 
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systems, non-destructive testing and environmental noise control. Like 
its electric counterpart, the sonic diode may produce unexpected con- 
sequences, in acoustics rather than electronics. 


Acoustic invisibility cloaks by metamaterials 


Another exciting and novel opportunity to control sound waves involves 
the design and fabrication of acoustic cloaking shells. The basic idea is to 
use a shell to guide acoustic waves around a certain object such that the 
object inside the shell becomes ‘invisible’ (undetectable by sound waves) 
(Fig. 4a). Although cloaking devices were initially introduced for elec- 
tromagnetic waves'*””, these principles have recently been extended to 
sound waves”’*’. Essentially, to obtain a cloaking device for acoustic 
waves, the cloaking shell surrounding the object must be made of an 
engineered artificial material, known as acoustic metamaterial, in which 
elastic properties such as mass density and bulk modulus are anisotropic 
and spatially dependent. 

An acoustic cloaking shell was introduced by considering the simila- 
rity between the 2D electromagnetic and acoustic wave equations”. The 
cylindrical shell is made of a metamaterial fluid with inhomogeneous 
bulk modulus and an anisotropic and inhomogeneous mass density that 
redirects pressure waves around the object (Fig. 4a). Cloaking effects have 
been extended to three dimensions by noting the isomorphism between 
the acoustic wave equation and the electrical conductivity equation”. 
Such three-dimensional (3D) cloaks are also obtained by considering 
zero scattering effects in the acoustic wave equation”. Because the 2D 
and 3D cloaking shells require complex materials, it is challenging to 
implement such metamaterials in experiments. Owing to this difficulty, 
additional theoretical approaches have been developed in which layered 
cylindrical cloaking shells are considered to obtain the required values for 
mass density and bulk modulus*”®. 

To overcome the difficulty associated with material requirements, an 
innovative approach based on the analogy between acoustic elements 
and electronic circuits has been employed to design and fabricate a 2D 
cylindrical cloak (Fig. 4b)””**. The basic idea is to convert the acoustic 
material to an analogous network of inductors L and capacitors C, where 
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Figure 3 | Acoustic diodes. a, An acoustic diode made of a 1D phononic 
crystal (alternating layers of glass and water) coupled to a nonlinear acoustic 
medium". A sound wave with frequency « within the PBG can evade the 
PBG—and pass through the system—only from left to right (forward 
direction), owing to the change of frequency « — 2 in the nonlinear medium. 
b, A linear acoustic diode consisting of a 2D phononic crystal (square steel rods 
with d= 4mm ina square lattice with a = 7 mm) and a diffraction structure’®. 


Out 


Waves with frequency @ within the PBG cannot pass through the structure if 
they are incident on the phononic crystal (reverse direction), but they can 

propagate if they are incident on the diffraction structure (forward direction). 
c, Sound rectification in 1D granular crystals (steel spheres with radius 9.53 mm 
and mass 28.84 g)!’. A defect sphere creates sound rectification by introducing 
localized states with frequencies /p that allow incident waves to evade the PBG 
(and pass through) by changing frequencies from @ to w’ (forward direction). 
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Figure 4| Acoustic cloaking. a, A sound wave incident on a rigid cylinder 
surrounded by a cloaking shell. The wave passes through the shell and 
propagates as it would have done in the absence of an obstacle. Reproduced 
from fig. 2 of ref. 25 with permission. b, A cylindrical acoustic cloak for 
ultrasound waves in water. Reproduced from fig. 2a of ref. 27 with permission. 
c, Cloaking shell for elastic waves made of concentric cylinders of PVC and 
PDMS. The coin diameter is 25.75 mm. Reproduced from fig. 2 of ref. 31 with 
permission. d, The measured pressure from a 60-kHz point source for a steel 
cylinder (grey) surrounded by the cloaking shell (light-blue coating around the 
cylinder) from b, where waves pass largely unperturbed (top). Scale bar, 2.7 cm. 
Elastic waves propagating through a cylinder (black) with the cloaking shell (dotted 
circular line) from c. The shell avoids disruption of the elastic waves (bottom). Scale 
bar, 5 cm. Reproduced from figs 3d and 3b of refs 27 and 31 with permission. 


the motion of the fluid is equivalent to the electrical current, and the 
mass density and bulk modulus are related to the capacitance C and 
inductance L. Using this ‘transmission line’ approach, an aluminium plate 
having an array of cavities (acting as acoustic capacitors) connected by 
channels (acting as acoustic inductors) was designed to perform as an 
acoustic cloak for ultrasound waves (Fig. 4b). The aluminium-based cloak 
was placed in a water tank (with an object within the shell) to compare 
ultrasound waves around the object with and without the cloak. When 
the object is surrounded by the metamaterial cloak, the incoming wave is 
restored behind the object with very little distortion, making the cloak 
and the object ‘invisible’ to ultrasound waves in water’’. This acoustic 
cloak—fabricated in the millimetre length scale—operates over a wide 
range of ultrasound frequencies (40-80 kHz). A previous cloak, fabricated 
at the centimetre scale, but designed for surface waves in a liquid material, 
has also provided cloaking effects in the range of 10-Hz frequencies”. 
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Besides full cloaks in water, an acoustic cloak that works in air, and for 
audible frequencies 1-4 kHz, has been designed, fabricated and charac- 
terized using acoustic metamaterials*’. This cloak, however, is based on a 
different technique known as ‘ground cloaking’, because the object to be 
made ‘invisible’ is located on a reflecting surface. The object on the 
surface is covered with a coating whose metamaterial properties are given 
by coordinate transformation acoustic theory. The function of the coat- 
ing is to make the object invisible. The metamaterial coating is made of a 
stack of planar perforated plastic plates, in which the size (millimetres) 
and shapes of the perforations are used to design the acoustic properties 
for cloaking. Using a 3-kHz speaker as a sound source, it is shown that the 
reflected acoustic field from the ground surface alone and that from the 
ground surface and the coated object are essentially identical, thus demon- 
strating a new way of cloaking objects from sound waves in air”. 

A cloaking shell for elastic waves*', fabricated using a structured 
polymer plate, demonstrates that theory and experiments on acoustic 
cloaking can be expanded from fluids to solid materials (Fig. 4c)?***. 
The solid cloaking shell consists of 20 concentric rings, each a tailored 
composite of polyvinyl chloride (PVC) and polydimethylsiloxane (PDMS) 
that allowed the Young’s modulus within the shell to be specified and 
cloaking effects to be obtained. To show the cloak in operation, elastic 
sound waves were guided through the shell (and around the object to be 
concealed) and their movement was recorded using a stroboscopic 
imaging technique. For frequencies of 200-400 Hz, the elastic waves can 
pass through the cloaking shell and propagate as they would have done 
without an obstacle—thus demonstrating acoustic cloaking in solid 
materials (Fig. 4d). This cloak is based on a theoretical proposal that 
modifies the technique of coordinate transformation of Maxwell’s equa- 
tions to create cloaking shells for elastic flexural waves in solids”. Recently, 
a theoretical approach for the control of coupled pressure and shear 
waves in elastic solids has also been introduced’’. 

The remarkable progress made in recent years in acoustic cloaking is 
evident from the large number of devices designed to hide objects from 
sound waves, which include air, water and solid-material platforms. In 
analogy with their electromagnetic counterpart, acoustic cloaking and 
metamaterials open up new ways to manipulate sound. Such metama- 
terials can be used to soundproof studios and buildings, design concert 
halls, control environmental noise, make ships invisible to sonar waves, 
or even design shields against seismic waves. Metamaterial concepts and 
devices bring together different research areas, including cosmology, 
physics, materials science and civil engineering. 


Enhancing the interaction between sound and light 


Interaction between phonons and photons is well known in studies of 
Raman spectroscopy and Brillouin scattering processes. Recently, a mecha- 
nism involving the simultaneous localization of mechanical and optical 
waves by using acoustic and optical cavities has been introduced greatly 
to enhance sound and light interactions. This raises the prospect of a new 
class of acousto-optical crystal that can integrate the combined manage- 
ment of phonons and photons!****?. 

A 1D crystal showing simultaneous localization of sound and light by 
resonant cavities was introduced by constructing an acoustic cavity 
within another optical cavity’*** (Fig. 5a). The optical cavity is made 
of Al,Ga,_,As and AlAs layers grown by molecular beam epitaxy, 
within which a spatially localized optical mode (red) can be created by 
introducing a defect layer. Similarly, the acoustic cavity is constructed by 
GaAs and AlAs layers, with a defect Al,Ga, — As layer that localizes 
sound modes (blue). The key feature of the system is that, owing to their 
relatively large wavelengths, photons see the acoustic layered structure 
as a homogeneous defect layer. In particular, when an incident laser 
photon excites the localized mode in the optical cavity, this mode splits 
photoelastically into a localized phonon mode in the acoustic cavity and 
a localized Stokes mode in the optical cavity. These localized phonons 
and photons leave the cavities by tunnelling through the layers. Measure- 
ments of the light spectrum for the outgoing photons clearly show the 
energy shift caused by these novel confined acousto-optical interactions**”*». 
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A different approach, in which sound and light are both localized in 
the same cavity, considered the use of periodic structures acting as both 
phononic and photonic crystals (Fig. 5b)'?. In analogy to phononic 
crystals, periodic dielectric structures such as photonic crystals can pre- 
cisely control light propagation**». An effective mechanism with which 
to create non-propagating localized states is the introduction of a defect 
in a periodic structure having a frequency bandgap**”’. Although exten- 
sive work has been focused on separate architectures for phononics and 
photonics, the existence of a PBG does not imply that the structure has a 
photonic bandgap and vice versa. These dual-bandgap (or ‘deaf and 
blind’) structures have been demonstrated by considering an array of 
air cylinders in a silicon matrix (Fig. 5b)’*”’. By taking advantage of 
simultaneous gaps, a defect is introduced to create a single cavity that 
localizes both phonons and photons. In particular, it was indicated that if 
the structure periodicity is a~ 150 nm, the cavity can co-localize 500- 
THz photons and 20-GHz phonons”. The simultaneous confinement of 
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Figure 5 | Enhancing sound-light interaction. a, An array of semiconductor 
layers creates an acoustic cavity within an optical cavity. A light beam entering 
the structure excites a localized light mode (red) within the optical cavity anda 
localized sound mode (blue) within the acoustic cavity. Light and sound escape 
by tunnelling—allowing their interaction to be measured. b, Periodic structures 
having dual photonic-phononic bandgaps can localize both sound and light in 
the same cavity. Transverse electric (TE) photons are localized within a 
structure made of air cylinders in silicon by creating a cavity (the missing 
cylinder) (left). Out-of-plane phonons are localized in the same dual 
photonic—phononic cavity (right). Co-localization can greatly enhance 
acousto-optical interactions. Images taken from fig. 3 of ref. 12. 

c, Co-localization and interaction of photons and phonons in a nanobeam with 
a central defect. Scanning electron microscope image for the nanobeam 
device (left). Optical and mechanical field distributions for co-localized and 
coupled opto-mechanical modes (right). Reproduced from figs 1 and 2a of 
ref. 40 with permission. 
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phonons and photons in a single cavity is suggested greatly to increase 
acousto-optical interactions, leading to a different class of planar acousto- 
optical devices that can integrate both light and sound management”. 

More recently, co-localization and interaction of 200-THz photons 
and 2-GHz phonons has been demonstrated” by the design, fabrication 
and characterization ofa 1D silicon nanobeam with rectangular air holes 
(Fig. 5c). To co-localize optical and mechanical modes at the centre of the 
structure, a ‘defect’ is introduced by quadratically—and symmetrically— 
decreasing the periodicity from the nanobeam centre. Electric and mech- 
anical field distributions for co-localized optical and mechanical modes 
are shown in Fig. 5c. In this system, the optical modes are excited by using 
a tapered optical fibre that also collects the transmitted light. Mechanical 
modes that can couple to optical modes are classified (according to their 
increasing frequencies) as ‘pinch’, ‘accordion’ and ‘breathing’. By measuring 
their radio-frequency spectrum, the character of these modes is described 
by evaluating the phase and amplitude change of the transmitted light. 
Importantly, the demonstration of phonon-photon coupling in a planar 
“‘optomechanical’ crystal allows for new methods of probing, stimulating, 
and using mechanical and optical interactions on a chip-scale platform”. 

The prospects of manipulating sound and light by co-localization in 
periodic crystals have stimulated significant research in recent years. 
This includes 1D vertical-cavity crystals*'’’, 2D and quasi-2D planar 
crystals*-** and 3D crystals**°°, involving exciting phenomena such as 
phoxonic gaps, optomechanical coupling, laser cooling, light modulation, 
quantum motion and phonon laser action*'*°. In addition, acousto- 
optical studies on crystal fibres (which have a solid silica central core 
surrounded by a lattice of hollow channels extending axially along its 
length) show interesting phenomena caused by the strong confinement 
of light and acoustic vibrations**’. These effects include multi-peak 
Brillouin spectra with frequency shifts in the 10-GHz range, as well as 
Raman-like nonlinear forward scattering of light with gigahertz phonons”. 
These results show that separate research areas such as those involving 
sound control in phononic crystals and light management in photonic 
crystals can be successfully merged into a field of dual phononic-photonic 
crystal materials. 


Thermal diodes 


The experimental realization of materials and devices that can provide 
unidirectional transmission of heat is more difficult than in the case of 
acoustic diodes. This is because heat is carried by a broad spectrum of 
high-frequency (terahertz) phonons, which are difficult to control. Despite 
this, novel theoretical and experimental concepts have been introduced 
whereby heat can flow only in a particular direction®. A theoretical 
thermal diode was proposed by means of two coupled one-dimensional 
lattices with particles interacting with each other through nonlinear 
sinusoidal potentials (Fig. 6a). In the forward direction (T;, > Tp), the 
broad spectrum of one chain overlaps with that of the other chain, allow- 
ing heat to flow through the structure. In the reverse direction (T,, < Tg), 
however, the vibrational frequencies of the lattices do not match and heat 
cannot flow through. This match-mismatch of the vibrational modes 
gives rise to the unidirectional transmission of thermal energy. Thermal 
rectification effects are achieved by carefully considering different strengths 
in the potentials of the two segments of the lattice. This theoretical 
model is a simplified version of a previously proposed diode involving 
a three-segment 1D nonlinear chain’. In recent years, the thermal diode 
concept has been expanded to include the thermal counterparts of the 
components of digital electronic circuits—the thermal transistor and 
thermal logic gates***. However, important material challenges must still 
be overcome before their practical realization and use as thermal devices. 

In experiments, thermal diodes were fabricated using carbon nano- 
tubes and boron nitride nanotubes™. To investigate asymmetric thermal 
motion, amorphous CoH.Pt is deposited non-uniformly along the 
length of the tubes (Fig. 6b). When the heat flows from the high-mass 
region (left) to the low-mass region (right), the measured thermal con- 
ductance is higher. Specifically, for carbon nanotubes and boron nitride 
nanotubes respectively, the conductance along the nanotube axis is 2% 
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and 7% larger than that in the reverse direction. To interpret these 
results, it was suggested that boron nitride nanotubes show larger ther- 
mal rectification effects owing to their strong ionic bonds, which favour 
nonlinear effects. This mechanism has subsequently been theoretically 
described by considering a 1D anharmonic lattice with a mass gradient. 
In agreement with experiments, this system provides a larger flow of 
heat when the heavy-mass end is set at a higher temperature®. The 
major impact of the electrical diode in electronic circuits suggests that 
highly efficient thermal diodes could have a similar influence in a large 
number of systems that require precise heat control, such as nanoelec- 
tronic devices and energy-saving buildings. 


Thermal metamaterials and heat cloaking 


Another interesting idea is to apply the metamaterial methods that are 
used for sound manipulation to the management of heat conduction. 
This concept has recently been proposed by realizing that mathematical 
techniques to transform Maxwell’s equations'*"’ can be adapted to the Fourier 
heat equation®**”°. Controlling heat conduction has always been a difficult 
scientific challenge. Using newly designed thermal metamaterials”, 
however, several striking effects on diffusive heat transfer have been 
implemented, offering a new way to manipulate heat flow (Fig. 7). 
Thermal metamaterials were introduced by considering a cylindrical 
shell that, under a thermal gradient, maintains the inner region at con- 
stant temperature, that is, the core is ‘invisible’ to the thermal gradient”. 
Interestingly, it was also suggested that an inverse thermal flow (or appar- 
ent negative thermal conductivity) might be achieved by changing the 
geometrical shape of the shell®*. These innovative theoretical concepts 
have been realized in experiments by constructing a thermal shield made 
of alternating concentric cylinders of latex rubber and a silicone elasto- 
mer, and placing the shield within an agar—water matrix (Fig. 7a). Under 
a thermal gradient, a constant temperature profile is obtained in the 
shield’s inner region while the temperature profile outside is almost 
undistorted. On the other hand, by intentionally arranging the materials 
along the radial direction, the shell can act as a thermal concentrator, with 
the thermal energy considerably enhanced within the shell and remain- 
ing uniform (Fig. 7b). Even more surprisingly, for a spiral arrangement of 
the layers in the shell, heat rotates within the composite shell in a spiral 
manner such that heat flux (and the thermal gradient) changes its sign 
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Figure 6 | Thermal diodes. a, Schematic ofa thermal diode concept using two 
coupled nonlinear one-dimensional lattices. If T, < Tp, (where ‘L’ means left 
and ‘R’ means right), vibrations in material B concentrate at low frequencies, 
but in material A they concentrate at high frequencies. The frequencies 

do not match, so heat cannot flow efficiently from right to left (reverse 
direction). If T, > Tp, material A expands its vibrations to low frequencies, 
overlapping with those from material B, and heat is allowed to flow through the 
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within the inner region of the shell—creating an apparent negative ther- 
mal conductivity (Fig. 7c)”. 

Thus heat management using metamaterials brings exciting new 
techniques with which to manipulate thermal energy. In analogy to 
optical and acoustical metamaterials, these offer unprecedented control 
of heat conduction. 


Nanoscale heat transport and thermoelectrics 


An additional important aspect of heat management is the need to reduce 
the amount of heat transport to ultralow levels. Materials with low ther- 
mal conductivities strongly affect the efficiency of thermoelectrics— 
which convert waste heat to electricity and provide an alternative source 
of energy for the future. In recent years, different strategies have been 
employed to enhance the efficiency of thermoelectrics, which is deter- 
mined by the figure of merit ZT = S’oT/k, where Tis the temperature, Sis 
the Seebeck coefficient, a is the electrical conductivity and x is the thermal 
conductivity. One approach involves the manipulation of the electronic 
properties—using quantum wells and wires—such that S and o (and thus 
ZT) are increased’'. Most recent efforts for enhanced thermoelectrics, 
however, have been based on reducing the thermal conductivity through 
nanostructuring. This has opened up promising opportunities to design 
and fabricate highly efficient thermoelectric materials. The remarkable 
progress in thermoelectrics is demonstrated by the large number of 
nanostructures with low thermal conductivities and enhanced ther- 
moelectric efficiencies that have been fabricated”””. 

Initial studies indicated that the Bi,Te3/Sb2Te3 and PbSe/PbTe quantum- 
dot superlattices exhibit enhanced thermoelectric properties”. Their 
high ZT values were attributed to the strong reduction in the thermal 
conductivity. Essentially, studies on superlattices demonstrate that the 
efficiency of thermoelectrics can be increased by purposely adding inter- 
faces that act as scattering centres for phonons, reducing their mean free 
paths and therefore the thermal conductivity. In addition to atomic- 
layer-deposition structures, bulk thermoelectrics made of nanostruc- 
tured AgPbSbTe compounds were shown to provide high ZT values”*. 
In these materials, nanometre-scale quantum dots are produced through 
a thermal processing technique. In analogy to superlattices, the quantum- 
dot interfaces strongly scatter phonons, providing a mechanism for redu- 
cing the thermal conductivity and increasing the figure of merit ZT. 


Amorphous C,H,,Pt 


structure (forward direction). Adapted from ref. 63 with permission. 

b, Schematic and scanning electron microscope image of a boron nitride 
nanotube thermal diode with asymmetric deposition of amorphous CgH¢Pt. 
For asymmetric nanotubes, higher thermal conductances are measured when 
heat flows from a high-mass region to a low-mass region. Reproduced from 
fig. 3b of ref. 64 with permission. 
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Enhanced thermoelectric efficiencies by reducing heat transport can 
also be achieved by combining alloys and nanoparticles”*. Thermal con- 
ductivity values below the alloy limit were obtained by epitaxially embed- 
ding ErAs nanoparticles in an Ing 53Gap.47As alloy. The basic idea is that 
the alloy matrix scatters short-wavelength phonons and the ErAs nano- 
particles stop mid-to-long-wavelength phonons, reducing the thermal 
conductivity below the alloy limit and improving thermoelectric effi- 
ciency. Another way of reducing phonon transport uses bulk nanocom- 
posites made of BiSbTe alloys with nanosize grains”*. In this ‘polycrystal’ 
technique, the material is converted into a nanoparticle dust by ball milling 
and then hot-pressed such that the nanoparticles merge together, creating 
nanosize grains. Measurements show” that the enhanced thermoelectric 
properties are the result of a significant reduction in thermal conductivity 
(the reduction is caused by scattering of phonons at the grain boundaries). 

A significant reduction in thermal transport has also been achieved in 
silicon nanowires’”””’. Large-area arrays of silicon nanowires grown by 
electrochemical synthesis show a surprising hundred-fold reduction in 
thermal conductivity”. The main reason for this reduction is the short- 
ening of the phonons’ mean free paths by phonon scattering at the rough 
nanowire surfaces. The development of theoretical models that can describe 
such ultralow thermal conductivities is currently an intense research area 
ranging from atomistic calculations to Boltzmann transport models. 

More recently, the combination of various scattering processes has 
been employed to reduce heat transport greatly in PbTe-based ther- 
moelectric materials”. First, atomic-scale substitutions (2 mol.% doping 
with Na) are introduced in a bulk PbTe sample to generate a figure of 
merit ZT ~ 1.1 at 775K. By including endotaxial SrTe nanoparticles 
with 2-10 nm sizes, ZT is increased to about 1.7 at 800 K. Then, by trans- 
forming the sample to a polycrystal material with micrometre grain 
sizes, a ZT value of about 2.2 is achieved at 915 K (ref. 79). The nano- 
composite provides a high ZT owing to the different length scales in the 
structure, which scatter phonons with different wavelengths. Essentially, 
atomic-scale substitutions block short-wavelength phonons, nanoparti- 
cles block mid-wavelength phonons and micrograins block large-wavelength 
phonons. These combined processes create an ultralow thermal conduc- 
tivity and a highly efficient thermoelectric. The Na-doped PbTe-SrTe 
matrix was synthesized by melting and quenching, while the polycrystal 
was produced by powder processing and spark plasma sintering. 
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Figure 7 | Thermal metamaterials. 
a, A thermal shield made of a 
concentric layered structure of latex 
rubber and silicone elastomer (top). 
The measured temperature profile 
for horizontal heat flux (white 
arrow): the shell maintains the inner 
region at constant temperature 
(bottom). b, A thermal concentrator 
made of azimuthally alternating 
layers of latex and elastomer (top). 
The temperature profile, where heat 
flux is increased by 44% in the inner 
region (bottom). c, A thermal 
inverter made of a spiral 
arrangement of copper and 
polyurethane (top). The temperature 
profile showing inversion of the heat 
flux (bottom). Reproduced from 
figs 2, 3i, 4c and 6a of ref. 69 with 
permission. 
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The idea of reducing heat transport through nanostructuring has 
been the basis for many recent advances in thermoelectric materials. 
The search for an easy-to-craft nanostructure with low thermal conduc- 
tivity is now one of the main challenges in thermoelectrics. Equally 
importantly, nanostructured materials show us that interface scattering 
is a very effective mechanism of stopping the flow of phonons in the 
high-frequency (terahertz) part of the phononic spectrum”. 


Thermocrystals and coherent heat effects 


An entirely new way of controlling heat transport, by which heat flow 
can be managed like sound waves", has recently been developed using 
‘thermocrystals’, which are periodic structures made of alloys containing 
nanoparticles (Fig. 8a). The basic idea of thermocrystals is to manipulate 
the heat frequency spectrum such that low-frequency phonons carry a 
considerable part of the heat. In this manner, heat-carrying phonons are 
subject to coherent reflection and transmission at interfaces, and thus many 
applications designed for sound management can be applied to heat flow. 

In thermocrystals*', high-frequency phonons are blocked by includ- 
ing alloy atoms and nanoparticles (d ~ 1 nm). The alloy atoms scatter 
high-frequency phonons, while the nanoparticles block another set of 
high-frequency phonons*™. In contrast to thermoelectrics®, very high- 
frequency phonons are blocked in thermocrystals*’. Because high-frequency 
phonons are severely restricted from carrying heat, the proportion of 
heat carried by low-frequency phonons increases (Fig. 8b). To concen- 
trate heat around a specific frequency range—with the purpose of match- 
ing the PBG—the contribution of very low-frequency phonons is also 
reduced by considering a thin-film material with rough surfaces. As a 
result of blocking the highest and lowest frequencies, most heat is thus 
concentrated to a relatively narrow low-frequency band. Specifically, fora 
SiggGej alloy with a 10% filling fraction of d~ 1 nm Ge nanoparticles, 
the frequency band is 0.1 THz < f< 2 THz, with up to 40% of the heat 
restricted to a narrow ‘hypersonic’ 100-300-GHz range (Fig. 8b). Taking 
advantage of this reduced heat spectrum, phononic crystals'*** with peri- 
odicities a ~ 10-20 nm are designed such that their PBGs match the 
frequencies of this narrow hypersonic range. These crystals are created 
by introducing air holes in the SiGe alloy thin film containing nanopar- 
ticles (Fig. 8a). In these ‘thermocrystals’ up to 23% of heat is carried by 
phonons with frequencies within the PBG. 
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Figure 8 | Thermocrystals. a, Schematic of a thermocrystal made of air 
cylinders on a nanostructured alloy film*’. b, Red shows the cumulative thermal 
conductivity as a function of frequency for SiggGe;9 nanoparticle films with 
different thicknesses. Yellow shows the frequency bandgaps for films patterned 
with a square arrangement of cylinders (S1) and with the design (S2). The inset 
shows that among the opportunities provided by thermocrystals is the 
possibility of guiding heat by removing a rod of cylinders. The colour scheme 
shows the displacement for a heat-carrying low-frequency phonon propagating 
within the waveguide. 


This matching of the heat frequency spectrum and the PBG demon- 
strates that thermal phonons can be managed through coherent inter- 
ference effects. In the past, phononic crystals have been shown to manage 
sound waves, but these are the first phononic crystals designed to control 
heat-carrying phonons. Thermocrystals lay the foundation for various 
applications such as heat waveguides, thermal lattices, heat imaging, 
thermo-optics, thermal diodes and thermal cloaking. 


Conclusions and perspectives 


Newly developed phononic crystals and metamaterials are both able to 
control phonon transport successfully at low and high frequencies in the 
phononic spectrum, ranging from sound to heat transfer. There is, 
however, an important difference between phonon management based 
on phononic crystals and that based on metamaterials. Whereas in 
phononic crystals the structure periodicity is generally of the same order 
as the wavelengths of the phonons to be controlled, in metamaterials the 
structure length scale can be smaller. This means that if we want to 
control large wavelengths in the phononic spectrum (Fig. 1), such as 
earthquake or tsunami waves, it would be more appropriate to use 
metamaterials, because the required periodicities for phononic crystals 
would be exceedingly large. On the other hand, if we want to manipulate 
the short-wavelength limit, that is, heat flow, then both thermocrystals 
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and thermal metamaterials may be appropriate choices depending on 
the structure length scale and intended applications, because heat flow 
can be controlled by using millimetre-scale metamaterials or nanometre- 
scale thermocrystals. 

I have also shown that heat flow can be blocked to ultralow levels 
using a large variety of nanostructured materials. This large reduction of 
the thermal conductivity is a consequence of diffuse interface scattering. 
When the nanostructured material is periodic, however, heat flow can be 
affected by two different physical mechanisms: diffuse interface scatter- 
ing or coherent wave interference. For a periodic nanostructure to con- 
trol heat coherently—and act as a phononic crystal—the wavelengths of 
heat-carrying phonons must be similar to the structure periodicity, 
A ~ 2a (Bragg reflection). Given that most heat phonon wavelengths 
in standard semiconductors at room temperature are relatively short 
(A ~ 1-10 nm in silicon), the Bragg condition is essentially not satisfied 
in typical periodic nanostructures, and this prevents heat manipulation 
by PBGs. By increasing heat phonon wavelengths using thermocrystals, 
however, coherent wave-interference effects (and heat wave transport) 
can be achieved in nanostructures that are periodic. 

This Review has shown that the development of new ideas for phonon 
management—combined with the ability to design and fabricate com- 
posite materials from the macroscale to the nanometre scale—has 
fuelled recent progress in sonic and thermal diodes, acoustic and ther- 
mal metamaterials, optomechanical crystals, hypersonic phononic crys- 
tals, thermoelectrics and thermocrystals. These advances have greatly 
increased our ability to manage the phononic spectrum at all relevant 
frequencies: sound, ultrasound, hypersound and heat. The emerging 
field of phonon management has great potential for innovations in 
materials and devices that can precisely manipulate sound and heat. 
Our ability to control electrons and photons has driven major technolo- 
gical revolutions in past decades; perhaps from our new ability to control 
phonons precisely we may expect analogously surprising and exciting 
consequences. 
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Dedifferentiation of committed epithelial 


cells into stem cells in vivo 
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Cellular plasticity contributes to the regenerative capacity of plants, invertebrates, teleost fishes and amphibians. In 
vertebrates, differentiated cells are known to revert into replicating progenitors, but these cells do not persist as stable 
stem cells. Here we present evidence that differentiated airway epithelial cells can revert into stable and functional stem 
cells in vivo. After the ablation of airway stem cells, we observed a surprising increase in the proliferation of committed 
secretory cells. Subsequent lineage tracing demonstrated that the luminal secretory cells had dedifferentiated into basal 
stem cells. Dedifferentiated cells were morphologically indistinguishable from stem cells and they functioned as well 
as their endogenous counterparts in repairing epithelial injury. Single secretory cells clonally dedifferentiated into 
multipotent stem cells when they were cultured ex vivo without basal stem cells. By contrast, direct contact with a 
single basal stem cell was sufficient to prevent secretory cell dedifferentiation. In analogy to classical descriptions of 
amphibian nuclear reprogramming, the propensity of committed cells to dedifferentiate is inversely correlated to their 
state of maturity. This capacity of committed cells to dedifferentiate into stem cells may have a more general role in the 


regeneration of many tissues and in multiple disease states, notably cancer. 


The term dedifferentiation was first coined to describe the process in 
which cells of the urodele retinal pigment epithelium lose their differ- 
entiated properties to replace extirpated lens cells’. Although not 
formally demonstrated, the term was used to suggest that differen- 
tiated epithelial cells reverted to a previous developmental stage before 
their subsequent differentiation into an alternative cell fate. Dedifferen- 
tiation has since been explored in plants, invertebrates, teleost fishes 
and amphibians*”. In vertebrates, quiescent differentiated cells can 
revert into replicating progenitor cells*”"!"*"* to replace lost cells, but 
these progenitor cells do not persist as stable stem cells'’. Indeed, in 
murine hair follicle regeneration, the immediate differentiated progeny 
of epithelial stem cells are already resistant to dedifferentiation’’. Conversely, 
the undifferentiated secretory progenitors of the intestine that are the 
immediate progeny of intestinal stem cells are able to dedifferentiate 
into stem cells after injury’’, mimicking the capacity for dedifferentia- 
tion of the immediate progeny of Drosophila germline stem cells*">'*. 
Recently, using stringent lineage-tracing strategies’*, airway epithelial 
cells have been shown to be more plastic than recognized previously. In 
a separate study, differentiated secretory cells have been shown to give 
rise to very rare cells (0.34 + 0.09%) that express basal cell markers 
after severe injury, but the properties of these rare basal-like cells were 
not studied and their functional capacity was not assessed’. Here, we 
specifically sought to determine whether stably committed luminal 
cells could dedifferentiate into functional stem cells. 


Secretory cells replicate after stem cell ablation 

Airway basal stem cells have been shown to self-renew and differenti- 
ate into multiple airway epithelial cell types using genetic lineage 
tracing”*’. Secretory cells are differentiated luminal cells that have 
both secretory and detoxifying functions. Secretory cells can also 


further differentiate into ciliated cells’’. To test whether secretory cells 
can dedifferentiate into stem cells, we ablated basal stem cells of the 
airway epithelium and simultaneously lineage traced the secretory 
cells of the same mouse (Extended Data Fig. 1). To ablate the airway 
basal stem cells, we generated a CK5-rtTA/tet(O)DTA””’ (hereafter 
referred to as CK5-DTA) mouse, which expresses the active subunit 
of the diphtheria toxin (DTA) in CK5 (also known as KRT5)* airway 
basal stem cells following doxycycline administration. CK5 expres- 
sion is, however, not restricted to the basal stem cells of the airway 
epithelium and is expressed in many other epithelial tissues*®’?. 
Therefore, the ablation of CK5-expressing cells using systemic dox- 
ycycline administration through drinking water or by intraperitoneal 
injection is lethal in the adult mouse (data not shown). To circumvent 
this problem, we designed a method for the inhalational delivery of 
doxycycline* as a means of inducing DTA transgene expression 
exclusively in the airway epithelium. CK5-DTA mice were exposed 
to either inhaled PBS (i-PBS) or inhaled doxycycline (i-Dox) for two 
or three consecutive days (Fig. 1a, b). Tracheas were isolated 24 h after 
the final dose of i-Dox. Immunofluorescence analysis using the basal 
stem cell markers p63 (also known as TP63), CK5, NGFR and Tla 
(also known as PDPN) demonstrated a specific dose-dependent abla- 
tion of basal cells and a preservation of normal secretory cell numbers 
(Fig. 1c, d and Extended Data Fig. 2a—-c). Approximately 80% (n = 3) 
of the airway basal cells were ablated after three doses of inhaled 
doxycycline (an 81.0 + 3.4% decrease in the number of CK5™ cells 
and a 78.8 + 2.4% decrease in the number of p63~ cells) (Extended 
Data Fig. 2d). Immunofluorescence staining of the proliferation mar- 
ker Ki67 in combination with cell-type-specific markers demon- 
strated that residual CK5* and p63 basal cells do proliferate, but 
there are actually fewer numbers of proliferating basal cells relative 
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Figure 1 | Secretory cells proliferate after basal cell ablation. a, Schematic 
representation of the ablation of CK5-expressing basal cells of the trachea. 
Secretory, ciliated and basal stem cells are shown in pink, blue and grey, 
respectively. b, Schematic of the timeline of i-Dox or i-PBS administration and 
tissue collection (C). ¢, Immunostaining for basal (p63 (green) and CKS (cyan)) 
and secretory cells (SCGB1A1 (green)) in combination with Ki67 (red) on 
either i-PBS (top)- or i-Dox (bottom)-treated mice (n = 6). White arrows, 
Ki67* cells. d, Quantification of the percentage of p63" and SCGB1A1° cells 
per total number of DAPI (4’,6-diamidino-2-phenylindole)-stained epithelial 
cells in i-PBS or i-Dox groups. n = 3. e, Percentage of p63* Ki67* and 
SCGB1A1* Ki67~ cells relative to total Ki67~ cells in i-PBS- and i-Dox 

(n = 3)-treated CK5-DTA mice. Nuclei, DAPI (blue). *P < 0.05, **P< 0.01 
(two tailed and paired t-test). n = 3 (three mice per condition). Error bars, 
average + s.e.m. Scale bars, 20 um. 


to the total population of replicating cells after ablation (Fig. Ic, e). 
Notably, there was a twofold increase in the numbers of replicat- 
ing secretory cells (SCGB1A1* Ki67*) in i-Dox-treated animals 
(51.29 = 3.02%) as compared to i-PBS-treated animals (17.7 + 2.68%) 
(Fig. 1c, e and Extended Data Fig. 3a). Consistent with the increased 
proliferation of differentiated secretory cells, Ki67 staining was specif- 
ically increased in the CK8 (also known as KRT8) * suprabasal layer of 
the airway epithelium (Extended Data Fig. 3b). Thus, secretory cells 
are the predominant cells that replicate after stem cell ablation. 
Interestingly, occasional cells expressed both the basal cell marker 
CK5 and the secretory cell marker SCGB1A1 in i-Dox-treated mice 
(Extended Data Fig. 3c). 


Secretory cells dedifferentiate in vivo 


To lineage-label secretory cells before stem cell ablation, we generated 
quadruple transgenic mice: Scgb1a1-creER/LSL-YFP::CK5-rtTA-tet(O)DTA 
(hereafter referred to as SCGB1A1-YFP/CK5-DTA mice). Administration 
of tamoxifen to induce the CreER-mediated expression of the yellow 
fluorescent protein (YFP) label in secretory cells was followed by three 
doses of i-Dox to induce basal cell ablation (Fig. 2a). Lineage-labelled 
YFP* secretory cells demonstrated increased rates of proliferation in 
i-Dox-treated animals as compared to i-PBS treated controls (Extended 
Data Fig. 3d, e). We identified YFP* secretory-cell-derived cells that 
were morphologically indistinguishable from basal stem cells (Fig. 2b). 
In addition, we found that a subset of lineage-labelled cells expressed a 
suite of basal cell markers including CK5, NGEFR, p63 and T1a (Fig. 2b 
and Extended Data Fig. 3f). Quantification revealed that 7.9 + 2.08% 
of basal cells (585 CK5* YFP* cells out of 7,320 total CK5* cells in 
i-Dox-treated animals, n = 6 mice) expressed a YFP lineage label, 
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Figure 2 | Luminal secretory cells dedifferentiate into basal stem cells after 


stem-cell ablation. a, Schematic representation of tamoxifen (Tam) and i-Dox 
administration to SCGB1A1-YFP/CK5-DTA mice followed by tissue 
collection (C). b, Immunostaining for CK5 (red; left) and NGFR (red; right) in 
combination with YFP (green) in i-PBS (top)- or i-Dox (bottom)-treated mice 
(n = 3). White arrows indicate double-positive cells. c, Flow-cytometric 
analysis of dedifferentiated cells. EPCAM* GSIB4* CD24° cells were analysed 
for expression of SSEA-1 and YFP in either i-Dox- or i-PBS-treated mice. n = 3 
(at least two mice each per condition). Scale bars, 20 jum. 


demonstrating that dedifferentiated basal-like cells comprised a sub- 
stantial fraction of the total stem cell pool. Dedifferentiated cells did not 
appear in PBS-treated controls (3 CK5* YFP* cells out of 7,558 total 
CK5* cells counted (0.041 + 0.028%; n = 6 mice)). Consistently, when 
the entire basal cell population is purified by flow cytometry, the YFP- 
lineage-labelled basal-like cells have lost the secretory cell surface mar- 
ker SSEA-1 (Fig. 2c). Thus, dedifferentiating cells lose markers of 
secretory cell differentiation as they acquire markers of basal stem cells. 


Secretory cells dedifferentiate ex vivo 


Because our in vivo experiments demonstrated that secretory cells are 
stimulated to dedifferentiate by the ablation of basal stem cells, we 
wondered whether secretory cell dedifferentiation could be induced 
ex vivo when secretory cells were cultured in the absence of basal stem 
cells. We reasoned that such an assay would provide a platform for 
further determining whether the dedifferentiation process is actively 
suppressed by the presence of co-cultured basal stem cells. To assess 
this possibility, we isolated and sorted unlabelled basal stem cells 
and YFP™ secretory cells from SCGB1A1-YFP mice after tamoxifen 
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injection (Extended Data Fig. 4). We then performed sphere-forming 
assays with these pure YFP-labelled secretory cells alone or in com- 
bination with pure sorted GSIB4* unlabelled basal cells in varying 
proportions. The formation of three possible types of spheres is pre- 
dicted to occur, including spheres that are comprised of (1) exclu- 
sively YFPT secretory-cell-derived cells; (2) exclusively unlabelled 
basal-cell-derived cells; or (3) chimaeric spheres containing YFPT 
secretory-cell-derived and unlabelled basal-cell-derived cells in the 
same sphere (Fig. 3a). Using our mixed cell populations, we identified 
a seeding density at which 99.14% + 0.18 (n =3) of spheres were 
either entirely YFP” or entirely unlabelled, thus establishing a clonal 
origin of the spheres. Regardless of the relative ratio of input secretory 
and basal stem cells cultured together in a well, the aggregate clonal 
sphere-forming efficiency of each cell type remained constant (Fig. 3b). 
Immunofluorescence analysis demonstrated that in many spheres ori- 
ginating from a single YFP” secretory cell, some YFP* cells expressed 
the basal stem cell markers CK5 and p63, indicating that secretory cell 
dedifferentiation occurs ex vivo (even in the presence of nearby purely 
stem-cell-derived spheres; Fig. 3c). Thus, we established an ex vivo 
clonal dedifferentiation assay. Of the total spheres derived clonally 
from YFP* secretory cells, 88% showed evidence of dedifferentiation, 
and in 70.58% of these spheres, more than 10% of cells expressed the 
stem cell marker CK5. Even when the input proportion of basal cells to 
secretory cells plated in Matrigel was 100:1, no inhibition of dediffer- 
entiation was observed in the spheres that were clonally derived from 
secretory cells (data not shown). This suggested the possibility that 
basal cells in one sphere do not provide a secreted factor to suppress 
the dedifferentiation of secretory cells in another sphere and that direct 
contact is required to suppress secretory cell dedifferentiation. Indeed, 
we observed that in the small fraction (0.86% + 0.09; n = 3) of chi- 
maerically derived spheres, not a single YFP" -labelled secretory cell 
dedifferentiated and went on to express basal cell markers (Fig. 3c). 
Therefore a single basal stem cell in direct contact with a secretory cell 
prevents dedifferentiation. Of note, these findings cannot entirely 
exclude the possibility that a secreted stem-cell-derived growth factor 
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can locally suppress secretory cell dedifferentiation. However, they do 
suggest that an intimate association of basal cells and secretory cells 
would be required to generate sufficient concentrations of a putative 
secreted dedifferentiation-suppressing activity. 

To provide further confirmation that secretory cells can dediffer- 
entiate into basal cells ex vivo, we used mice carrying a doxycycline- 
inducible basal-cell-specific reporter allele (CK5-rtTA/tet(O)H2BGFP) 
and sorted SSEA-1*/H2B-GFP (in which the human histone H2B 
coding sequence is fused to the GFP coding sequence) secretory cells 
from their tracheas (Extended Data Fig. 5a, b) and performed a sphere- 
forming assay. We verified that no GFP cells were present before 
induction by doxycycline. When doxycycline was administered during 
the course of sphere formation, the first appearance of H2B-GFP 
occurred at day 3 of culture, indicating that the secretory cells had been 
converted into basal-like cells. Immunofluorescence analysis revealed 
that the resulting H2B-GFP* cells also expressed p63 and CK5 (Extended 
Data Fig. 5c, left panels). The same results were obtained when the 
sorted SSEA-1*/H2B-GFP cells were grown on transwells (Extended 
Data Fig. 5c, right panels). In addition, secretory cells also dedifferen- 
tiated into basal-like cells when we cultured lineage-labelled YFP * cells 
of SCGB1A1-YFP mice on transwell membranes or as spheres 
(Extended Data Fig. 6a-d). These ex vivo-dedifferentiated cells could 
be serially passaged five times in transwell culture or as spheres and 
retain their expression of basal stem cell markers and their ability to 
self-renew (Extended Data Fig. 6d, e). Similarly, cells that had under- 
gone dedifferentiation in vivo could also be passaged as stable stem cells 
(Extended Data Fig. 6f, g). Thus, dedifferentiated secretory cells can 
stably self-renew. 


Mature secretory cells resist dedifferentiation 


To determine whether all secretory cells have the potential to dedif- 
ferentiate or whether only a subset of secretory cells is endowed with 
this capacity, we attempted to subset this class of epithelial cells. To do 
so, we made use of a transgenic mouse strain that expresses enhanced 
green fluorescent protein (eGFP) specifically in secretory cells of the 


Figure 3 | Secretory cells 
dedifferentiate in the absence of 
basal cells in an ex vivo sphere- 
forming assay. a, Schematic 
representation of lineage labelling, 
sorting and ex vivo sphere-forming 
assay. Schematic representation of 
different types of spheres anticipated 
from the basal and secretory cell- 
mixing assay. b, Quantification of the 
number of spheres that are either 
basal-cell-derived (grey bars) and 
secretory-cell-derived (green bars). 
x axis, the ratios of basal to secretory 
cells seeded. y axis, number of 
spheres formed relative to the 
number of cells seeded. 

c, Immunostaining for p63 (red; top) 
or CK5 (red; bottom) in combination 
with YFP (green). n = 3 (two 
replicates per condition). Error bars, 
average + s.e.m. Scale bars, 20 jim. 
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airway epithelium driven by the promoter of the B1 subunit of the 
vacuolar H(+)-ATPase gene (Atp6v1b1; which we reasoned would be 
associated with mature secretory cells and hereafter refer to as B1- 
eGFP)**”°. Co-immunostaining for SSEA-1 and GFP demonstrated 
the existence of three subpopulations of secretory cells: SSEA-1*/ 
GFP, SSEA-1*/GFP* and SSEA-1 /GFP* (Extended Data Fig. 7a). 
Of note, all GFP* cells are SCGB1A1" secretory cells and none are 
CKS5* basal cells (Extended Data Fig. 7a). To define the cellular hier- 
archy of these three subsets of cells, we exposed the airway epithelium 
of B1-eGFP mice to sulphur dioxide (SO,) injury. In this injury model, 
SO, causes the complete sloughing of only the suprabasal differen- 
tiated cells. The remaining basal stem cells are left intact and start 
replicating within 24hto give rise to a mature epithelium within 
14 days. We found that single-positive SSEA-1* cells appeared first 
on day 4 and then matured into double-positive SSEA-1"/GFP cells 
on day6 (Extended Data Fig. 7b) before the formation of any fully 
mature single-positive Bl-eGFP* cells evident in the fully mature 
homeostatic epithelium (Extended Data Fig. 7a, bottom panel, arrow- 
heads). Using B1-eGFP mice, we performed sphere-forming assays 
with each of the three subsets of secretory cells (Fig. 4a). Intriguingly, 
all three subsets of cells formed similar large spheres and all these spheres 
contained basal-like cells (Fig. 4b-d). Interestingly, the sphere-forming 
ability of the three populations was inversely proportional to the rela- 
tive maturity of the secretory cell subsets (Fig. 4d). Of note, most of 
the cell aggregates produced from the most mature SSEA-1” /GFP* 
secretory cell subset occurred as small cell clusters instead of spheres 
(Fig. 4c, e). Furthermore, these cell clusters did not contain CK5- or 
p63-expressing basal stem cells (Fig. 4c). 


Dedifferentiated cells stably persist 


To assess whether dedifferentiated stem-cell-like cells have the ability 
to self-renew and persist in vivo, we generated SCGB1A1-YFP/CK5- 
DTA mice that possessed lineage-labelled dedifferentiated basal-like 
cells as above and these mice were then maintained for 2 months before 
euthanization (Extended Data Fig. 8a). Dedifferentiated YFP* CK5* 
cells persisted and continued to represent a sizeable fraction of the stem 
cell pool (9.15% + 0.41; n = 3). The relative pool size of dedifferen- 
tiated basal-like cells remained stable over the course of 2 months 
(dedifferentiated basal cells represented 8% of the stem cell pool imme- 
diately after dedifferentiation). In addition, triple immunostaining 
for CK5, YFP and Ki67 revealed that YFP* CK5* dedifferentiated 
basal-like cells have the same self-renewal rates as do their normal 
YEP CK5* basal stem cell counterparts (Extended Data Fig. 8b, c). 


Dedifferentiated cells are functional stem cells 


To assess the functional stem cell capacity of dedifferentiated basal- 
like cells, we generated SCGB1A1-YFP/CK5-DTA mice that possessed 
lineage-labelled dedifferentiated basal-like cells and then exposed these 
animals to two forms of physiologic airway injury (Fig. 5a, b). First, a 
toxin-induced airway injury with inhaled SO, was used to efficiently 
denude suprabasal cells from the airway epithelium, leaving behind a 
single layer of basal cells, some of which were derived from labelled 
secretory cells that had dedifferentiated (marked by YFP) (Extended 
Data Fig. 9a). The epithelium fully regenerated in 14 days as expected 
and immunofluorescence analysis for YFP in combination with CK5 
(basal cell), SCGB1A1 (secretory cell) and FOXJ1 (ciliated cell) revealed 
that YFP” cells contributed to all three epithelial cell lineages in the 
form of scattered YFP™ patches (Fig. 5c, top panels). To further scru- 
tinize the functional potential of our dedifferentiated basal-like cells, 
we used influenza viral infection as a second physiologic injury model 
(Extended Data Fig. 9b)”. We again observed that dedifferentiated 
basal-like cells participate in regeneration by giving rise to all three 
epithelial cell types of the airway (Fig. 5c, bottom panels). Similarly, 
sorted dedifferentiated cells that were produced either in vivo or ex vivo 
could be serially passaged in culture and differentiated into mature 
airway epithelium (Fig. 5d-f and Extended Data Fig. 10a-—c). 
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Figure 4 | Dedifferentiation potential of secretory cells is inversely 
proportional to their maturity. a, Schematic representation of dissociation 
and sorting of three subsets of secretory cells on the basis of SSEA-1 and GFP 
expression. b, Sorting of secretory cell subsets followed by sphere-forming 
assay. Representative images of the predominant type of cell aggregates 
(spheres or cell clusters) from secretory cell subsets. c, Immunostaining for CK5 
(green) and p63 (red) on cell aggregates. d, e, Quantification of the number of 
spheres (d) or cell clusters (e) from secretory cell subsets. n = 3 (two replicates 
per condition). B1, Bl1-eGFP; S$, SSEA-1. Error bars, average + s.e.m. Scale bars, 
20 Lum. 


Furthermore, we asked whether individual dedifferentiated basal- 
like cells are multipotent (that is, able to give rise to ciliated, secretory 
and basal cells) or unipotent (that is, able to give rise to only one of the 
cell types). To address this issue, we cultured individual dedifferen- 
tiated cells and then performed an air-liquid interface culture using 
these clonally derived stem cells. Immunofluorescence analyses for 
basal, secretory and ciliated cell markers revealed that most of the 
clonally derived basal-like cells (11 out of 13 clones) are multipotent 
(Extended Data Fig. 10d). Intriguingly, rare clones give rise to only 
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ciliated (1 out of 13 clones) or secretory cells (1 out of 13 clones). The 
potential of single dedifferentiated cells to act as multipotent stem 
cells suggests that some or most basal stem cells in vivo may analo- 
gously serve as multipotent stem cells. However, the rare lineage 
restricted clones we observe may reflect a heterogeneity in the basal 
cell population in vivo that warrants future scrutiny. 


Discussion 


Here, using lineage tracing, we have presented evidence that fully 
differentiated cells can revert into stable and functional basal stem 
cells. By contrast, in the hair follicle, the committed progeny of skin 
stem cells do not revert into stem cells when the skin stem cells are 
ablated'’. However, in the murine intestinal epithelium, undifferenti- 
ated secretory progenitors that are the immediate villin-negative pro- 
geny of stem cells can revert back to a stem cell state after injury’. 
Notably, the capacity of fully differentiated villin-positive intestinal 
secretory cells to revert into stem cells was not assessed. Here we show 
that fully committed secretory cells respond to stem cell ablation by 
proliferating and converting into functional epithelial stem cells. Our 
study points to an alternative cellular mechanism through which 
tissues can regenerate after stem cell loss. The existence of multiple 
cellular reservoirs of regenerative capacity may allow a more effective 
reparative response when one or the other cell type is damaged by a 
toxic or infectious insult. 

The ability of basal stem cells to prevent the dedifferentiation of 
secretory cells has many implications for tissue biology in general, as 
stem cells and their progeny can now be seen to reciprocally modulate 
one another to regulate their relative ratios and thereby overall tissue 
architecture. In our example, the prevention of secretory cell dediffer- 
entiation occurs through direct contact, even with a single stem cell. 
This mechanism ensures a precise and local control of epithelial 
architecture. More generally, it suggests that the reciprocal interac- 
tions of stem and committed cells may have been ‘designed’ to ensure 
a robust self-organizing property in diverse tissue types. 

Furthermore, the ability of differentiated cells to acquire stem cell 
properties seems to be inversely proportional to the degree of the 
maturity of the differentiated cells. This is analogous to the results 
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Figure 5 | Dedifferentiated cells are 
functional stem cells both in vivo 
and ex vivo. a, Schematic 
representation of the regeneration of 
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seen in amphibian nuclear reprogramming in which nuclei from 
more mature cells were less easily reprogrammed than those of their 
immature counterparts**”’. The capacity to segregate secretory cells 
according to their maturity and associated ability to resist dediffer- 
entiation provides an ideal in vivo experimental model for dissecting 
the molecular mechanisms through which cells might lock their iden- 
tity as they mature. Finally, our findings have broad implications 
for cancer biology, as our results point to an underlying physiologic 
form of cell plasticity that could be co-opted in the process of tumor- 
igenesis***. Indeed, some lung cancers seem to be able to resist 
chemotherapy by using a lineage conversion into a different tumour 


subtype”. 


METHODS SUMMARY 


CK5-rtTA”, Scgblal-creER”, tet(O)DTA” and B1-eGFP”*”* mice were described 
previously. Corn oil or tamoxifen (2 mg per 20 g body weight) were intraperito- 
neally injected for five consecutive days. Aerosolized PBS or doxycycline was 
administered by inhalation’. SO injury models have been reported previously”. 
Mice were anesthetized and infected with a sublethal dose of influenza by intra- 
nasal inhalation as described previously”’. Sphere culture and staining was per- 
formed as described previously’. Immunofluorescence and cell sorting were 
performed using standard protocols. 
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METHODS 

Mouse models. CK5-rtTA”, Scgb1a1-creER”, tet(O)DTA” and B1l-eGFP”*”® mice 
were described previously. Rosa26R-eYFP (Gt(Rosa )26Sor'"E¥FPICOS) 7) mice 
(stock no. 006148) and Tg(tet(O)HIST1H2BJ/GFP)47Efu/J; (stock no. 005104) 
were purchased from The Jackson Laboratory. CK5-rtTA females were crossed 
to tet(O)DTA males to generate a double-transgenic mouse (CK5-DTA) that 
expresses DTA protein upon doxycycline administration. Aerosolized doxycy- 
cline or PBS was administered as described previously™*. For secretory cell lineage 
tracing after basal cell ablation, we crossed male Scgbla1-creER/Rosa26R- YFP to 
CK5-DTA female mice to generate quadruple (SCGB1A1-YFP/CK5-DTA)- 
transgenic mice. To label secretory cells, we injected tamoxifen intraperitoneally 
(2 mg per 20 g body weight) for five consecutive days to induce the Cre-mediated 
excision of a stop codon and subsequent expression of YFP. Both male and female 
mice were used for experiments. 6-12-week-old mice were used for experiments. 
Similar aged mice were used for both control and treated animals. We analysed at least 
three mice per condition in each experiment. The MGH Subcommittee on Research 
Animal Care approved animal protocols in accordance with NIH guidelines. 
SO, and influenza infection induced injury. SO, injury models have been 
previously reported’’. In brief, mice were exposed to 500 p.p.m. of SO, for 3h 
40 min. For influenza experiments, influenza A/Puerto Rico/8/34 (PR8) was 
obtained from Charles River Laboratories International. Mice were anaesthetized 
and infected with a sublethal dose of influenza by intranasal inhalation as 
described previously”’. Mice were killed at day 3 or 14 after infection, and the 
tracheas were removed and placed in 4% paraformaldehyde (PFA). 
Immunofluorescence, microscopy and cell counting. Tracheae were dissected 
and fixed in 4% PFA for 2h at 4°C followed by two washes in PBS, and then 
embedded in OCT. Cryosections (6 Jum) were permeabilized with 0.1% Triton 
X-100 in PBS, blocked in 1% BSA for 30 min at room temperature (27 °C), incu- 
bated with primary antibodies for 1h at room temperature, washed, incubated 
with appropriate secondary antibodies diluted in blocking buffer for 1h at room 
temperature, washed and counterstained with DAPI. Spheres were fixed with 4% 
PFA on day 9 after plating and washed with PBS and stained as described above. 

The following primary antibodies were used: chicken anti-GFP (1:500; GFP- 
1020, Aves Labs); rabbit anti-Ki67 (1:200; ab15580, Abcam); rat anti-Ki67 (1:200; 
14-5698-82, eBioscience); goat anti-SCGB1A1 (1:500; provided by B. Stripp); 
mouse anti-p63 (1:100; sc-56188, Santa Cruz); mouse anti-tubulin, acetylated 
(1:100; T6793, Sigma); mouse anti-FOXJ1 (1:500; 14-9965, eBioscience), rabbit 
anti- NGFR (1:200; ab8875, Abcam); mouse IgM anti-SSEA-1 (1:100; 14-8813-82, 
eBioscience); hamster anti-Tla (1:50, DSHB) and rabbit anti-cytokeratin 5 
(1:1,000; ab53121, Abcam). All secondary antibodies were Alexa Fluor conjugates 
(488, 594 and 647) and used at 1:500 dilution (Life Technologies). 

Images were obtained using Olympus IX81 Inverted microscope (Olympus). 
Confocal images were obtained with a Nikon A1 confocal laser-scanning micro- 
scope with a 40X or 60 oil objective (Nikon CFI Plan APO VC 40 or 60X Oil). 
Cells were manually counted based on immunofluorescence staining of markers 
for each of the respective cell types. Cartilage 1 to 9 was used as reference points in 
all the tracheal samples to count specific cell types on the basis of immunostain- 
ing. Serial sections were stained for the antibodies tested and randomly selected 
slides were used for cell counting. Percentage of basal cells per sphere was calculated 


based on CKS immunostaining. Spheres with less than 2% CK5™ cells were not 
included in the quantification. 

Cell dissociation and sorting. Airway epithelial cells from trachea were disso- 
ciated using papain solution. Longitudinal halves of the trachea were cut into five 
pieces and incubated in papain dissociation solution and incubated at 37°C 
for 2h. After incubation, dissociated tissues were passed through a cell strainer 
and centrifuged and pelleted at 500g for 5 min. Cell pellets were dispersed 
and incubated with Ovo-mucoid protease inhibitor (Worthington biochemical 
Corporation, cat. no. LK003182) to inactivate residual papain activity by incub- 
ating on a rocker at 4°C for 20 min. Cells were then pelleted and stained with 
EPCAM-PECy7 (1:50; 25-5791-80, eBioscience) or ERCAM-APC (1:50; 17-5791, 
eBioscience), GSIB4 (Griffonia Simplicifolia isolectin beta 4)-Biotin (L2120, 
Sigma), SSEA-1 eFluor 650NC (1:75, 95-8813-41, eBioscience) and PE anti-mouse 
CD24 (1:100, 553262, BD Pharmingen) for 30 min in 2.5% FBS in PBS on ice. After 
washing, cells were sorted on BD FACS Aria (BD Biosciences) using FACS Diva 
software and analysis was performed using FlowJo (version 10) software. 
Sphere-forming assays and transwell cultures. Cells were cultured and expanded 
in complete SAGM (small airway epithelial cell growth medium; Lonza, CC-3118) 
containing TGF-B/BMP4/WNT antagonist cocktails and 51M Rock inhibitor 
Y-27632 (Selleckbio, $1049). To initiate air-liquid interface cultures, airway basal 
stem cells were dissociated and seeded onto transwell membranes. After confluence, 
media was removed from the upper chamber. Mucocilary differentiation was per- 
formed with PneumaCult-ALI Medium (StemCell, 05001). Differentiation of air- 
way basal stem cells on an air-liquid interface was followed by directly visualizing 
beating cilia in real time after 10-14 days. For clonal culture assays, dedifferentiated 
basal-like cells (GSIB4* YFP*) were sorted and plated on collagen-coated plates 
at low cell density to obtain individual colonies. Individual colonies were isolated 
using microscopic visualization of single colonies followed by trypsin treatment 
and pipette-assisted aspiration. Individual colony-derived cells were maintained 
and expanded separately and used for air-liquid interface culture. Sphere culture 
was performed as described previously”®. In brief, 50 pl of 1:4 cold Matrigel/ 
MTEC-plus medium was layered on an 8-well chamber slide (Thermo Scientific, 
cat. no. 177402) and incubated at 37 °C for 10 min to solidify the Matrigel. Sorted 
cells were mixed in 2% Matrigel in MTEC-plus and plated on pre-coated 8-well 
chamber slide at a density of 4,000 cells per well. For mixing assays, sorted cells 
were seeded at a density of 6,000 cells per well. In each experiment, three inde- 
pendent wells were used for each condition tested. Medium was changed every 
other day for 9 days. For transwells cultures, cells were suspended in MTEC-plus 
medium and plated on transwell inserts at a cell density of 6,000 cells per well. 
Medium was changed every day for 7 days. For basal-cell reporter assays, cells 
were treated with 1 ugml ' doxycycline either at 24h after plating or just 24h 
before collection. For serial passaging of spheres, medium from culture wells was 
aspirated, washed with PBS, and then treated with trypsin-EDTA (0.25%) for 
2 min. Trypsin was inactivated and dissociated cells were collected and centrifuged 
at 350g for 3 min at 4 °C. Cells were re-seeded at a 1:20 dilution in Matrigel for the 
next round of sphere culture. 

Statistical analysis. The standard error of the mean was calculated from the 
average of at least three independent tracheal samples unless otherwise men- 
tioned. Data were compared among groups using the Student’s t-test (unpaired, 
two-tailed). A Pvalue of less than 0.05 was considered significant. 
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Extended Data Figure 1 | Schematic representation of the dedifferentiation 
of luminal secretory cells into functional basal stem cells. a, Differentiated 
luminal secretory cells are labelled with a YFP lineage tag in a homeostatic 
airway epithelium. Basal stem cells are then ablated using diphtheria toxin. 
In response, lineage-labelled secretory cells dedifferentiate into cells that 
morphologically resemble basal cells and express basal stem cell markers. 
These dedifferentiated basal-like cells respond to physiologically relevant toxic 
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and infectious injury and serve as multipotent stem cells during epithelial 
regeneration. The inset depicts the different cell types of the airway epithelium. 
b, Graphical representation of the dedifferentiation potential of differentiated 
basal-like cells. x axis represents the maturity of a secretory cell; y axis 
represents the propensity for dedifferentiation to a basal-like cell. The 
propensity to dedifferentiate is inversely correlated to the maturity of the 
secretory cell. 
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Extended Data Figure 2 | Inhaled doxycycline efficiently ablates basal stem 
cells of the airway epithelium. a, Schematic representation of basal-cell- 
specific ablation using i-Dox. b, Co-labelling of p63 (green) and CK5 (red) on 
tracheal sections CK5-DTA mice that received either i-PBS (top) or i-Dox 
(middle and bottom panels show 2 and 3 doses of i-Dox, respectively). 

c, Co-labelling of NGFR (green) and T1« (red) i-PBS- or i-Dox-treated mice. 
d, Quantification of the number of p63” (black bar) and CK5* (grey bar) basal 


p63 CK5 DAPI 


®p63 SCKS 


PBS Dox (2) Dox (3) 


cells from CK5-DTA animals treated with PBS (p63, 1,229 + 65.45; CK5, 
1,376 + 25.23), two doses of i-Dox (p63, 690 + 35.13; CK5, 716 + 12.44) or 
three doses of i-Dox (p63, 262 + 29.5; CK5, 255 + 46.82). yaxis represents the 
absolute numbers of basal cells (from three independent tracheal sections). 
Dox(2) and Dox (3) refer to two and three doses of doxycycline inhalation, 
respectively. n = 3 (two mice per condition). Error bars, average + s.e.m. Scale 
bars, 20 bm. 
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Extended Data Figure 3 | Secretory cells begin to express proliferation and 
stem cell markers and undergo dedifferentiation following basal cell 
ablation. a, Orthogonal confocal optical sections of SCGB1A1 (green), CK5 
(cyan) and Ki67 (red) XY and XZ planes are shown to demonstrate the 
colocalization of Ki67 and SCGB1A1. b, Quantification of the percentage of 
Ki67* CK8~ double-positive cells per total Ki67~ cells from i-PBS 

(23.74% + 6.76)- and i-Dox (63.22% + 4.14)-treated CK5-DTA mice. 

c, Co-labelling of CK5 and SCGB1A1 in i-Dox-treated CK5-DTA mice. White 
arrows, double-positive cells. d, Immunostaining for YFP (green) and Ki67 
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(red) on sections from SCGB1A1-YFP/CK5-DTA mice. White arrows indicate 
YFP” Ki67" cells. e, Quantification of the percentage of YFP* Ki67* cells per 
total Ki67* cells in SCGB1A1-YFP/CK5-DTA mice that were treated with 
either i-Dox (31.74% + 7.15) or i-PBS (9.65% + 2.12). f, Co-labelling of YFP 
(green) with p63 or T1a (red) on tracheal sections of SCGB1A1-YFP/CK5- 
DTA mice that were either treated with i-PBS (top) or i-Dox (bottom). White 
arrows, double-positive cells. n = 3 (three mice per condition). Error bars, 
average + s.e.m. Scale bars, 20 jim. 
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Extended Data Figure 4 | Dissociation and fluorescence activated cell 


secretory cells and GSIp4* basal cells were sorted. b, Of the total epithelial cells, 
sorting of airway epithelial cells. a, Schematic representation of tracheal 


EPCAM* CD24° cells were gated to remove ciliated cells (left). Then, YFP* 
epithelial cell dissociation from secretory cell lineage-labelled mice (SCGB1A1- secretory cells were separated from GSIB4* basal cells (middle). Sorted YFP* 
CreER/LSL-YFP) after five doses of tamoxifen. Of the total epithelial cells, cells were also marked by the secretory cell marker SSEA-1 as expected for a 

EPCAM* CD24™ cells were gated to remove ciliated cells. Then YFP* pure population of SCG1A1° cells (right). 
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Extended Data Figure 5 | Sorted secretory cells dedifferentiate into basal- 
like self-renewing stem cells upon ex vivo culture. a, Schematic 
representation of tracheal epithelial cell dissociation from basal cell reporter 
mice (CK5-rtTA/tet(O)H2BGEP) followed by sorting of GEP-SSEA-1" 
secretory cells. Sorted SSEA-1~ cells were grown as spheres in Matrigel or 
plated on transwell membranes. Doxycycline was administered and cells were 
monitored for the initiation of GFP expression. b, Fluorescence-activated cell 
sorting of SSEA-1™ cells from basal cell reporter mice (CK5-rtTA/ 
tet(O)H2BGFP). Arrows indicate gating windows. EPCAM is a pan-epithelial 
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marker used to exclude non-epithelial lineages. Sorted SSEA-1* secretory cells 
did not express GFP. c, Immunofluorescence staining for p63 (red; top) or CK5 
(red; bottom) in combination with H2B-GFP (green; all panels) on sorted 
secretory cells that were either cultured as Matrigel spheres (left) or on 
transwells (right). Immunofluorescence analysis confirmed that H2B-GFP* 
cells expressed p63 and CKS5, again confirming that secretory cells 
dedifferentiate in culture. n = 3 (two replicates per condition). Scale bars, 

20 Lm. 
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Extended Data Figure 6 | Sorted lineage-labelled secretory cells undergo 
dedifferentiation ex vivo, express basal stem cell markers, and can be serially 
passaged, as can secretory cells that underwent dedifferentiation in vivo. 
a, Schematic representation of secretory cell labelling, sorting and subsequent 
culturing in Matrigel or on transwell membranes. b, Cell colonies obtained 
from early passage cultures of YFP~ secretory-cell-derived cells on transwell 
membranes. c, Immunostaining for CK5 (red), p63 (magenta) and YFP (green) 
on passage-five basal cell colonies from ex vivo-dedifferentiated cells. 

d, Schematic showing that YFP* secretory-cell-derived spheres from 


SCGB1A1-CreER/LSL-YFP mice were serially passaged for five generations. 
e, Quantification of the sphere-forming efficiency: P1 (2.86% + 0.65), P2 
(3.36% + 0.6), P3 (2.31% + 0.32), P4 (2.75% + 0.69) and P5 (2.7% + 0.94). 

x axis, number of passages. y axis, percentage of spheres formed. f, Schematic 
representation of in vivo dedifferentiation followed by the sorting and culturing 
of YFP™ basal-like cells. g, Immunostaining for CK5 (red), p63 (magenta) and 
YFP (green) on passage-five cell colonies from in vivo-dedifferentiated cells. 
n = 3 (two replicates per condition). Error bars, average + s.e.m. Scale bars, 
20 Lum. 
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Extended Data Figure 7 | B1-eGFP transgenic mice express GFP in mature | SSEA-1* Bl-eGFP, whereas white arrowheads point to cells that are 


subsets of secretory cells. a, Co-labelling of GFP (green) with CK5 or Bl-eGFP* SSEA-1 (bottom). b, Bl-eGEP trachea were stained for GFP 
SCGB1A1 or SSEA-1 (all in red) on large airways sections derived from (green) and SSEA-1 (red) on day 4 and day 6 after SO,-induced injury. 
adult B1-eGFP transgenic mice at homeostasis. White arrows indicate n = 3 (two mice per condition per time point). Scale bars, 20 jum. 
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Extended Data Figure 8 | Dedifferentiated basal-like stem cells are stable 
and self-renew to the same degree as endogenous basal stem cells. 

a, Schematic representation of the dedifferentiation protocol to assess the 
ability of basal-like stem cells to persist and self-renew for 2 months. 

b, Immunofluorescence staining for YFP (green) in combination with CK5 
(cyan) or Ki67 (red) on sections from SCGB1A1-YFP/CK5-DTA mice 

2 months after basal cell ablation. c, Quantification of the percentage of 


proliferating dedifferentiated CK5* YEP* (10.16% + 2.57) (green bar) and 
wild-type CK5* YEP” (9.25% + 0.79) (black bar) stem cells out of the total 
CK5*~ stem cell population in the large airways of SCGB1A1-YFP/CK5-DTA 
mice 2 months after basal cell ablation. The white arrow points to a 
proliferating dedifferentiated basal-like stem cell. n = 3 (two mice per 
condition). Error bars, average + s.e.m. Scale bar, 20 um. 
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Extended Data Figure 9 | SO,-and influenza-induced injury efficiently cells (yellow). b, Immunofluorescence analysis of basal cells (CK5, red) and 
removes the suprabasal cells of the airway epithelium. secretory cells (SCGB1A1, green) from uninjured (top), 72 hours post-infection 


a, Immunofluorescence staining for CK5 (red) and YFP (green) in SCGB1A1- _—_(h.p.i.; middle) and 14 days post-infection (d.p.i.; bottom). White arrow, 
YFP/CK5-DTA airway epithelium 24 h after SO, inhalation. Onlyasinglelayer _ secretory cell debris (in green). n = 3 (two replicates per condition). Scale bars, 
of basal cells persists after injury. White arrows, CK5* YFP* double-positive 20 um. 
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Extended Data Figure 10 | Ex vivo-dedifferentiated cells can be clonally SCGB1A1 or acetylated-tubulin (red) on differentiated epithelium derived 
expanded and give rise to a complete airway epithelium in air-liquid from serially passaged dedifferentiated basal-like cells. d, Whole-mount 
interface cultures. a, Schematic representation of the ex vivo dedifferentiation immunofluorescence staining for SCGB1A1 (red) and acetylated-tubulin 

of secretory cells, serial passage and differentiation. b, Whole-mount (green) on clonally derived epithelium (left). Co-labelling of YFP (green) with 
immunostaining for YFP (green) in combination with CK5 or p63 or acetylated © CK5 (white) and CK8 (red; top) or SCGB1A1 and FOXJ1 (bottom) on cross 
(AC)-tubulin (red). c, Co-labelling of YFP (green) with CK5 or p63 or sections of clonally derived epithelium. Scale bars, 20 um. 
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Human immunodeficiency virus type 1 (HIV-1)-specific monoclonal antibodies with extraordinary potency and breadth 
have recently been described. In humanized mice, combinations of monoclonal antibodies have been shown to suppress 
viraemia, but the therapeutic potential of these monoclonal antibodies has not yet been evaluated in primates with an 
intact immune system. Here we show that administration of a cocktail of HIV-1-specific monoclonal antibodies, as well 
as the single glycan-dependent monoclonal antibody PGT121, resulted in a rapid and precipitous decline of plasma 
viraemia to undetectable levels in rhesus monkeys chronically infected with the pathogenic simian-human 
immunodeficiency virus SHIV-SF162P3. A single monoclonal antibody infusion afforded up to a 3.1 log decline of 
plasma viral RNA in 7days and also reduced proviral DNA in peripheral blood, gastrointestinal mucosa and lymph 
nodes without the development of viral resistance. Moreover, after monoclonal antibody administration, host 
Gag-specific T-lymphocyte responses showed improved functionality. Virus rebounded in most animals after a 
median of 56 days when serum monoclonal antibody titres had declined to undetectable levels, although, notably, a 
subset of animals maintained long-term virological control in the absence of further monoclonal antibody infusions. 
These data demonstrate a profound therapeutic effect of potent neutralizing HIV-1-specific monoclonal antibodies in 
SHIV-infected rhesus monkeys as well as an impact on host immune responses. Our findings strongly encourage the 


investigation of monoclonal antibody therapy for HIV-1 in humans. 


A series of broad and potent HIV-1 Env-specific monoclonal antibodies 
have recently been isolated'* and have been shown to target the CD4 
binding site*’, the V1/V2 loops®”, the V3/V4 loops and N332 glycans’, 
and the membrane proximal external region”. Previous studies in huma- 
nized mice and humans using the earlier generation of HIV-1 Env- 
specific monoclonal antibodies suggested that the therapeutic potential 
of monoclonal antibodies would be severely limited by the rapid emer- 
gence of viral escape mutations in the context of diverse virus swarms'*””. 
However, cocktails of three or five of the new generation of more potent 
monoclonal antibodies targeting multiple epitopes have recently been 
shown to suppress HIV-1 replication in humanized mice’*”’. 


Therapeutic efficacy of monoclonal antibody cocktails 


To evaluate the therapeutic potential of broad and potent HIV-1-specific 
monoclonal antibodies in primates with an intact immune system, we 
infused cocktails of monoclonal antibodies as well as single monoclonal 
antibodies into chronically SHIV-infected rhesus monkeys. We focused 
on the N332 glycan-dependent monoclonal antibody PGT121 (ref. 10) 
and the CD4-binding-site-specific monoclonal antibodies 3BNC117 
(ref. 6) and b12 (ref. 20). In the first study, we used eight Indian origin 
adult rhesus monkeys (Macaca mulatta) that did not express the class I 
alleles Mamu-A*01, Mamu-B*08 and Mamu-B*17 and that were 


infected intrarectally with the pathogenic virus SHIV-SF162P3 9 months 
before the monoclonal antibody infusions. These animals exhibited 
chronic set-point viral loads of 3.4-4.9 log RNA copies ml~' with 
clinical disease progression and reduced CD4* T-lymphocyte counts. 
We performed two intravenous monoclonal antibody infusions on day 
0 and day 7 with 10 mgkg’ * of each of PGT121, 3BNC117 and b12 
(n = 4); or with 30 mgkg’ * of the isotype-matched control monoclo- 
nal antibody DEN3 (n = 1) or saline (n = 3). 

After the initial monoclonal antibody infusion, we observed rapid 
and precipitous declines of plasma viral loads to undetectable levels by 
day 7 in 4 of 4 monkeys (Fig. 1a). Virological control persisted for 84 
to 98 days in animals 82-09, 98-09 and 161-09 (Fig. 1b). After viral 
rebound, sequence analysis'**' showed no N332 or other characteristic 
escape mutations (Supplementary Information), and viral rebound 
correlated with the decline of serum monoclonal antibody titres to 
undetectable levels <1 jig ml” ' (Extended Data Fig. 1). Monkey 82-09 
exhibited transient viraemia on day 28 (Fig. 1b), which correlated with 
the decline of serum monoclonal antibody titres to undetectable levels 
(Extended Data Fig. 1), but this animal then spontaneously re-controlled 
viral replication until day 98. Monkey 163-09, which had the lowest 
baseline viral load of 3.4 log RNA copies ml before the monoclonal 
antibody infusion, exhibited long-term virological control for over 
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Figure 1 | Therapeutic efficacy of the triple PGT121, 3BNC117 and b12 


monoclonal antibody cocktail. a, b, Plasma viral RNA (log copies ml!) in 
rhesus monkeys chronically infected with SHIV-SF162P3 after infusions of 
PGT121, 3BNC117 and b12 on day 0 and day 7 (arrows) for 14 days (a) and 
224 days (b). c, d, Plasma viral RNA in rhesus monkeys chronically infected 
with SHIV-SF162P3 after infusions with the control monoclonal antibody 
DEN3 (87-09) or saline on day 0 and day 7 (arrows) for 14 days (c) and 224 days 
(d). e, Proviral DNA (copies per 10° peripheral blood mononuclear cells 
(PBMCs)) in the monkeys that received the therapeutic monoclonal antibody 
cocktail or controls (inset). f, Log inhibition of viral replication in CD8* 
T-lymphocyte-dependent virus suppression assays after monoclonal antibody 
infusion. One animal had no recoverable virus at week 2. g, h, PD-1*Ki67* 
expression on Gag-specific CD8* (g) and CD4~ (h) T lymphocytes after 
monoclonal antibody infusion. 


200 days despite the absence of detectable serum monoclonal antibody 
titres after day 70 (Fig. 1b). Proviral DNA in peripheral blood also 
declined rapidly by tenfold in the monkeys that received the mono- 
clonal antibodies (Fig. le). Virological control was not observed in the 
monkeys that received DEN3 or saline (Fig. 1c, d), and viral loads on 
day 14 were significantly lower in the monoclonal-antibody-treated 
monkeys than in the controls (P = 0.02, Mann-Whitney U-test). 

As expected, serum neutralizing antibody titres” to the SHIV-SF162P3 
challenge virus increased markedly after the monoclonal antibody 
administration and then declined over time (Extended Data Fig. 2). 
After clearance of the monoclonal antibodies, neutralizing antibody 
titres to SHIV-SF162P3 as well as to the related neutralization-sensitive 
virus SHIV-SF162P4 remained slightly higher than baseline titres 
(Extended Data Fig. 2). The magnitude of total Gag-specific CD8* 
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and CD4* T-lymphocyte responses”? was not detectably modulated 
after monoclonal antibody administration (Extended Data Fig. 3). 
However, by day 28, we observed threefold and fivefold reductions, 
respectively, in the percentage of Gag-specific CD8* and CD4* 
T lymphocytes that expressed the exhaustion and activation markers 
PD-1 and Ki67 (Fig. 1g, h; P= 0.02). Moreover, CD8* T lymphocytes 
from these animals exhibited increased functional capacity to suppress 
virus replication after monoclonal antibody infusion”® (Fig. 1f; P = 0.03). 
These data indicate that monoclonal antibody administration not only 
exerted direct antiviral effects but also improved host immune responses. 

We next investigated the therapeutic efficacy of a single infusion of 
the cocktail of three monoclonal antibodies as well as a combination 
of only two monoclonal antibodies. Fourteen rhesus monkeys infected 
with SHIV-SF162P3 9 months before the monoclonal antibody infu- 
sion with chronic set-point viral loads of 3.2-5.6 log RNA copies ml! 
received a single infusion on day 0 with 10mgkg * of each of the 
monoclonal antibodies PGT121, 3BNC117 and b12 (n = 5); PGT121 
and 3BNC117 (n=5); or the isotype-matched control monoclonal 
antibody DEN3 (n= 4). We observed rapid virological control to 
undetectable levels by day 7 in 3 of 5 animals that received the cocktail 
of three monoclonal antibodies and in 5 of 5 animals that received only 
PGT121 and 3BNC117 (Fig. 2a, b). The two animals that failed to 
achieve complete virological suppression had the highest baseline 
plasma viral loads of 5.4 and 5.6 log RNA copies ml | before the 
monoclonal antibody infusion and exhibited 2.8 and 2.9 log declines, 
respectively, in plasma viraemia before rapid viral rebound on day 21 
(monkeys 4907, 4909; Fig. 2a). No characteristic viral escape mutations 
were detected after viral rebound (Supplementary Information). The 
animals that suppressed viral loads to undetectable levels exhibited up 
to a 3.1 log decline of plasma viral RNA copies ml ' by day 7 (monkey 
4912; Fig. 2b). Viral rebound occurred in most animals between day 28 
and day 84 (Fig. 2a, b) and was associated with declines of serum 
monoclonal antibody titres to undetectable levels (Extended Data 
Fig. 4). The animal with the lowest baseline viral load of 3.2 log RNA 
copies ml’ exhibited long-term virological control for over 100 days 
(monkey 4905; Fig. 2b), and viral loads on day 14 were significantly 
lower in both groups of monoclonal-antibody-treated monkeys than in 
the controls (P = 0.01). One control animal (monkey 5324; Fig. 2c) was 
euthanized for progressive clinical AIDS and opportunistic infections 
during the course of this experiment. PGT121 and 3BNC117 adminis- 
tration also resulted in threefold and fourfold reductions in the per- 
centage of PD-1*Ki67* Gag-specific CD8* and CD4* T lymphocytes, 
respectively (Fig. 2e, f). 

To confirm that viral rebound was not associated with the develop- 
ment of viral resistance to the monoclonal antibodies, we performed a 
second infusion of monoclonal antibodies on day 105 in the monkeys 
that received PGT121 and 3BNC117. Viral re-suppression was observed 
in 4 of 4 animals after the second monoclonal antibody infusion 
(Fig. 2d and Supplementary Information). However, virological con- 
trol appeared less durable and serum monoclonal antibody titres were 
lower after the second monoclonal antibody infusion as compared with 
the first monoclonal antibody infusion (Fig. 2b, d and Extended Data 
Fig. 4), presumably as a result of monkey anti-human antibody res- 
ponses that developed after the first monoclonal antibody administra- 
tion (Extended Data Fig. 5). Nevertheless, we assessed the impact of the 
second monoclonal antibody infusion on proviral DNA in various 
tissue compartments”*”’ and observed a twofold decline in lymph nodes 
(P = 0.1; Fig. 2g) and a threefold decline in gastrointestinal mucosa 
(P = 0.02; Fig. 2h) 14 days after monoclonal antibody re-administration. 
These data indicate that the potent monoclonal antibodies not only 
suppressed viraemia but also reduced proviral DNA in tissues without 
the generation of viral resistance. 


23,24 


Therapeutic efficacy of single monoclonal antibodies 


Although the cloned SHIV-SF162P3 pseudovirus is highly sensitive to 
3BNC117, we observed that our particular SHIV-SF162P3 challenge 
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Figure 2 | Therapeutic efficacy of the double PGT121 and 3BNC117 
monoclonal antibody cocktail. a—c, Plasma viral RNA (log copies ml‘) in 
rhesus monkeys chronically infected with SHIV-SF162P3 after a single infusion 
(arrows) of PGT121, 3BNC117 and b12 (a); PGT121 and 3BNC117 (b); or the 
control monoclonal antibody DEN3 (c). d, Plasma viral RNA in monkeys after 
a second infusion of PGT121 and 3BNC117 on day 105. e, f, PD-1" Ki67* 
expression on Gag-specific CD8* (e) and CD4* (f) T lymphocytes in the monkeys 
that received PGT121 and 3BNC117. g, h, Proviral DNA (copies per 10° cells) in 
lymph nodes (g) and gastrointestinal mucosa (h) before (day 105) and 14 days 
after (day 119) the second monoclonal antibody infusion with PGT121 and 
3BNC117 in the four animals with detectable viraemia. GMMCs, gastrointestinal 
mucosa mononuclear cells; LNMCs, lymph node mononuclear cells. 


stock was largely resistant to 3BNC117 (Extended Data Fig. 6), which 
raised the possibility that the observed therapeutic efficacy of the 
monoclonal antibody cocktail in the previous experiment may have 
been due to PGT121 alone. We therefore performed a single infusion 
of 10mgkg_' PGT121 alone (n = 4), 3BNC117 alone (n = 4), or the 
control monoclonal antibody DEN3 (n = 4) in 12 rhesus monkeys that 
were infected with SHIV-SF162P3 9 months before the monoclonal 
antibody infusion with chronic set-point viral loads of 3.3-5.4 log RNA 
copies ml’. PGT121 alone resulted in rapid virological control to 
undetectable levels by day 7 in 4 of 4 animals, followed by viral rebound 
by day 42 to day 56 in 3 animals that again correlated with declines in 
serum PGT121 titres to undetectable levels (Fig. 3a, c and Extended 
Data Fig. 7; P= 0.02 comparing viral loads on day 14 in PGT121- 
treated animals compared with controls). One animal exhibited 
long-term virological control (monkey DN1G; Fig. 3a). PGT121 alone 
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also reduced proviral DNA by sixfold in peripheral blood (P = 0.05; 
Fig. 3d), fourfold in lymph nodes (P = 0.05; Fig. 3e), and fourfold in 
gastrointestinal mucosa (P = 0.1; Fig. 3f) as compared with the DEN3 
control on day 14. Moreover, PGT121 alone resulted in threefold and 
fivefold reductions in the percentage of PD-1*Ki67* Gag-specific 
CD8* and CD4* T lymphocytes, respectively (Fig. 3g, h). In contrast, 
3BNC117 alone, to which our SHIV-SF162P3 stock was relatively 
resistant, resulted in only a transient 0.2-1.1 log reduction of plasma 
viral loads (Fig. 3b), and one animal in this group (monkey CW9G) 
was euthanized for progressive clinical AIDS during this experiment. 


Kinetics of virological control 

The kinetics of the initial decline of plasma viraemia after infusion of 
PGT121 or PGT121-containing monoclonal antibody cocktails was a 
median of 0.382 logs per day (inter-quartile range (IQR) 0.338-0.540). 
In contrast, the initial kinetics of decline of plasma viraemia after 
raltegravir-containing combination antiretroviral therapy in HIV-1- 
infected humans was a median of 0.264 logs per day (IQR 0.253-0.284)* 
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Figure 3 | Therapeutic efficacy of the single monoclonal antibodies PGT121 
and 3BNC117. a-c, Plasma viral RNA (log copies ml *) in rhesus monkeys 
chronically infected with SHIV-SF162P3 after a single infusion (arrows) of 
PGT121 (a), 3BNC117 (b), or the control monoclonal antibody DEN3 

(c). df, Proviral DNA (copies per 10° cells) in peripheral blood mononuclear 
cells (PBMCs) (d), lymph nodes (e) and gastrointestinal mucosa (f) 14 days 
after the monoclonal antibody infusion in the animals that received PGT121 or 
DEN3. Red bars indicate mean values. Assays for one of the DEN3-treated 
controls failed. g, h, PD-1*Ki67~ expression on Gag-specific CD8* (g) and 
CD4* (h) T lymphocytes in the monkeys that received PGT121. 
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and after combination antiretroviral therapy in SIV-infected monkeys 
was a median of 0.229 logs per day (IQR 0.198-0.265) (J.B.W., unpub- 
lished data) (Extended Data Table 1). Although these reflect different 
models, the rapid virological control after monoclonal antibody admin- 
istration in SHIV-infected rhesus monkeys is striking and consistent 
with a mechanism that involves direct elimination of free virus in plasma 
in addition to virus-infected cells in tissues. The rapid reduction of 
proviral DNA in peripheral blood by day 1 (Fig. le) suggests direct 
antibody-mediated cytotoxic effects on infected cells”, although we 
cannot exclude the possibility of an effect of CD4* T-cell trafficking. 


Summary and implications 


Our studies demonstrate the profound therapeutic efficacy of PGT121 
and PGT121-containing monoclonal antibody cocktails in chronically 
SHIV-SF162P3 infected rhesus monkeys. The therapeutic efficacy in 
the 18 animals that received PGT121 alone or as part of a cocktail 
(Fig. 4a) was dependent on baseline viral loads before monoclonal 
antibody administration. In the 17% of animals (3 of 18) with low 
baseline viral loads <3.5 log RNA copies ml}, long-term control of 
viral replication was observed for the duration of the follow-up period 
(Fig. 4b), which included a substantial period of time after serum 
monoclonal antibody titres had declined to undetectable levels. These 
observations suggest that PGT121 may have converted animals with 
low baseline viraemia into ‘elite controllers’, although additional follow- 
up is required to assess the durability of this effect. These animals still 
had detectable, albeit reduced, proviral DNA in tissues (Figs 2g, h and 
3e, f), and thus virus was not eradicated in these animals. In the 72% of 
animals (13 of 18) with intermediate baseline viral loads 3.5-5.3 log 
RNA copies ml” ', plasma viraemia was rapidly reduced to undetect- 
able levels within 7 days but then rebounded after a median of 56 days 
when serum monoclonal antibody titres declined to undetectable levels 
<1 pg ml’ (Fig, 4c). In the 11% of animals (2 of 18) with high baseline 
viral loads >5.3 log RNA copies ml’, incomplete control of plasma 
viraemia and rapid viral rebound was observed, indicating a thera- 
peutic ceiling in this model (Fig. 4d). Taken together, baseline viral 
loads strongly correlated with the time to viral rebound (P = 0.0002, 
Spearman rank-correlation test; Fig. 4e). 

We speculate that the therapeutic impact of these monoclonal antibodies 
reflected not only their direct antiviral activity but also their impact on 
host antiviral immune responses. After monoclonal antibody infusion, 
we observed modest increases in host virus-specific neutralizing antibody 
activity (Extended Data Fig. 2) as well as reduced activation and improved 
functionality of host virus-specific T-lymphocyte responses (Figs 1f-h, 
2e, f and 3g, h). Consistent with these findings, median set-point viral 
loads after viral rebound were 0.61 log lower than median set-point viral 
loads at baseline before monoclonal antibody infusion (P = 0.0005, 
Wilcoxon matched pairs signed-rank test; Fig. 4f), with no evidence 
of reduced viral replicative capacity*° (Extended Data Fig. 8). Moreover, 
3 of 18 monkeys exhibited long-term virological control to undetectable 
levels (Fig. 4b). Defining the precise immunological mechanisms of 
improved virological control after monoclonal and polyclonal antibody 
administration*’”’ warrants further investigation. 

Previous studies in humanized mice and humans showed that the 
earlier generation of neutralizing HIV-1-specific monoclonal anti- 
bodies was unable to control viraemia’*'’. More recent studies in 
humanized mice have shown that combinations of three or five of 
the new generation of more potent monoclonal antibodies suppressed 
HIV-1 replication, although single monoclonal antibodies still rapidly 
selected for resistance’*"’. In contrast to these previous studies, we 
observed that a single infusion of PGT121 in rhesus monkeys resulted 
in rapid virological control in both peripheral blood and tissues. It is 
possible that intrinsic differences between HIV-1 replication in mice 
and SHIV replication in monkeys may account for these differences. 
Another key difference is the functional immune system in rhesus 
monkeys as compared with immunosuppressed humanized mice, and 
the profound virological suppression without the development of 
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Figure 4 | Therapeutic efficacy of PGT121 or PGT121-containing 
monoclonal antibody cocktails in chronically SHIV-infected rhesus 
monkeys. a-d, Summary of the therapeutic effect of PGT121 alone or 
PGT121-containing monoclonal antibody cocktails in the 18 rhesus monkeys 
chronically infected with SHIV-SF162P3 (a), as well as in the subgroups of 
animals with baseline viral loads of <3.5 log RNA copies ml | (n=3) (b), 
3.5-5.3 log RNA copies ml! (n = 13) (c) and >5.3 log RNA copies ml! 

(n = 2) (d). Red lines indicate median log viral loads. e, Correlation of baseline 
viral loads with times to viral rebound. P value reflects two-sided Spearman 
rank-correlation test. f, Comparison of set-point viral loads at baseline before 
monoclonal antibody administration and after viral rebound. P value reflects 
two-sided Wilcoxon matched pairs signed-rank test. Red line indicates median 
log viral loads. 


resistance in the present study may reflect the functional host antibody 
effector activity and the intact antiviral cellular immune responses in 
rhesus monkeys. A caveat is that we were unable to quantify the 
intrinsic ability of SHIV-SF162P3 to escape from PGT121 in vivo, 
although previous studies have documented the ability of SHIV- 
SF162P3 and SHIV-SF162P4 to escape from autologous antibody res- 
ponses in other settings****. N332A-mutated SHIV-SF162P3 exhibited 
only partial escape from PGT121 in vitro but complete escape from 
other N332-dependent monoclonal antibodies including PGT124 and 
PGT128, suggesting a high bar to resistance (Extended Data Fig. 9). 


Concluding remarks 

Our data demonstrate profound therapeutic efficacy of broad and 
potent HIV-1-specific monoclonal antibodies in rhesus monkeys 
chronically infected with SHIV-SF162P3. Our SHIV-SF162P3 stock 
is highly pathogenic, as evidenced by moderate to high chronic set- 
point viral loads and AIDS-related deaths in two animals during the 
course of these experiments (Figs 2c and 3b). Moreover, in a separate 
study*’, our SHIV-SF162P3 stock led to AIDS-related mortality in 5 of 
12 (42%) rhesus monkeys by 1 year after infection, which is compar- 
able to the reported pathogenicity of S[Vmac251 (refs 36, 37) and 
greater than the reported mortality of SHIV-AD8 (refs 38, 39). 
Nevertheless, given the differences between SHIV-infected rhesus 
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monkeys and HIV-1-infected humans, clinical trials will be required 
to establish the therapeutic efficacy of potent neutralizing HIV-1- 
specific monoclonal antibodies in humans. Although multiple mono- 
clonal antibodies targeting different epitopes will probably prove to be 
superior, our data suggest that monotherapy with the most potent and 
broad monoclonal antibodies also warrants clinical evaluation. Moreover, 
the ability of these monoclonal antibodies to reduce proviral DNA in 
tissues suggests that these monoclonal antibodies should also be evalu- 
ated in the context of viral eradication strategies. 


METHODS SUMMARY 


Thirty-four Indian origin, young adult, male and female rhesus monkeys (Macaca 
mulatta) that were infected by the intrarectal route with SHIV-SF162P3 and 
followed 9 months before monoclonal antibody administration were used for these 
studies. Cocktails of monoclonal antibodies (PGT121, 3BNC117, b12) or single 
monoclonal antibodies (PGT121, 3BNC117) were administered once or twice at 
10mgkg ' for each monoclonal antibody by the intravenous route. DEN3 was 
used as an isotype-matched monoclonal antibody control. Monkeys were bled up 
to three times per week for evaluation of plasma viral loads. Lymph node and 
gastrointestinal mucosal biopsies were processed as single cell suspensions” for 
proviral DNA assays as previously described”®. SIV Gag-specific cellular immune 
responses were assessed by multiparameter intracellular cytokine staining (ICS) 
assays~>** and functional virus suppression assays” essentially as described. HIV- 
1-specific neutralizing antibody responses were assessed by TZM-bl luciferase- 
based neutralization assays”. PGT121, 3BNC117 and b12 concentrations were 
determined by monoclonal-antibody-specific neutralizing antibody assays. Virus 
sequencing and analyses of breakthrough viruses were performed essentially as 
described'**". Statistical analyses involved Mann-Whitney U-tests for comparing 
independent sets of virological and immunologic parameters, Wilcoxon matched 
pairs signed-rank tests for paired analyses, and Spearman rank-correlation tests for 
correlation analyses. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
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METHODS 


Animals and study design. Thirty-four Indian-origin, outbred, young adult, 
male and female, specific pathogen-free rhesus monkeys (Macaca mulatta) that 
did not express the class I alleles Mamu-A*01, Mamu-B*08 and Mamu-B* 17 assoc- 
iated with spontaneous virological control were housed at New England Primate 
Research Center, Bioqual, or Alpha Genesis. Groups were balanced for susceptible 
and resistant TRIM5« alleles. Groups of 4-5 monkeys provided statistical power to 
detect large differences in viral loads, and animals were randomly allocated to 
balance baseline viral loads. Animals were infected by the intrarectal route with 
our rhesus-derived SHIV-SF162P3 challenge stock 9 months before monoclonal 
antibody administration. PGT121, b12 and DEN3 monoclonal antibodies were 
generated as previously described’* and were expressed in Chinese hamster ovary 
(CHO-K1) cells and purified by protein A affinity chromatography. 3BNC117 was 
manufactured by Celldex Therapeutics in CHO cells and purified by chromato- 
graphy and sterile filtration. All the monoclonal antibody preparations were endo- 
toxin free. Cocktails of monoclonal antibodies or single monoclonal antibodies 
were administered to monkeys once or twice by the intravenous route at a dose of 
10 mg kg” ' for each monoclonal antibody. Monkeys were bled up to three times 
per week for viral loads. Immunological and virological data were generated blinded. 
All animal studies were approved by the appropriate Institutional Animal Care and 
Use Committee (IACUC). 

Cellular immune assays. SIV Gag-specific cellular immune responses were assessed 
by multiparameter intracellular cytokine staining (ICS) assays essentially as descri- 
bed’***, 12-colour ICS assays were performed with the Aqua green-fluorescent 
reactive dye (Invitrogen, L23101) and predetermined titres of monoclonal antibodies 
(Becton Dickinson) against CD3 (SP34; Alexa Fluor 700), CD4 (OKT4; BV711, 
Biolegend), CD8 (SK1; allophycocyanin-cyanine 7 (APC-Cy7)), CD28 (L293; 
BV610), CD95 (DX2; allophycocyanin), CD69 (TP1.55.3; phycoerythrin-Texas 
red (energy-coupled dye; ECD); Beckman Coulter), gamma interferon (IFN-y) 
(B27; phycoerythrin-cyanine 7 (PE-Cy7)), Ki67 (B56; fluorescein isothiocyanate 
(FITC)), CCR5 (3A9; phycoerythrin), CCR7 (3D 12; Pacific Blue) and PD-1 (EH21.1; 
peridinin chlorophyll-A-cyanine 5.5 (PerCP-Cy5.5)). IFN-y backgrounds were 
consistently <0.01% in PBMCs and LNMCs and <0.05% in colorectal biopsy 
specimens. Virus suppression assays were performed by co-culturing purified 
CD8* T lymphocytes with CD8-depleted PBMCs and monitoring p27 levels for 
7-14 days essentially as described”. 

Neutralizing antibody assays. HIV-1-specific neutralizing antibody responses 
against primary infectious stocks of SHIV-SF162P3 and SHIV-SF162P4 were 
assessed by TZM-bl luciferase-based neutralization assays”. PGT121 titres were 
determined by X2088_c9 and ZM247v1(Rev-) pseudovirus neutralization, 3BNC117 
titres were determined by 6041.v3.c23 and Q461.ez pseudovirus neutralization, 
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andb12 titres were determined by Du422.1.N332A pseudovirus neutralization and 
B2.1 ELISA. 

Proviral DNA assay. Proviral DNA was quantified as previously reported”*. 
Lymph node and gastrointestinal mucosal biopsies were processed as single cell 
suspensions essentially as previously described’’. Tissue-specific total cellular DNA 
was isolated from 5 X 10° cells using a QIAamp DNA Blood Mini kit (Qiagen). The 
absolute quantification of viral DNA in each sample was determined by qPCR using 
primers specific to a conserved region SIVmac239. All samples were directly com- 
pared to a linear virus standard and the simultaneous amplification of a fragment 
of human GAPDH gene. The sensitivity of linear standards was compared against 
the 3D8 cell line as a reference standard as described’®. All PCR assays were 
performed with 100 and 200 ng of sample DNA. 

Virus sequencing. Virus sequencing of breakthrough virus was performed essen- 
tially as described'*. Plasma samples of 1 ml were centrifuged for 30 min at 
20,000g and the lowest fraction was subjected to RNA purification (QiaAmp 
MinElute Virus Spin kit; Qiagen). Random hexamers (Roche) or SHIV-SF162P3- 
specific (5’-AAGAGCTCCTCCAGACAGTGAG-3’ or 5'-TAGAGCCCTGGA 
AGCATCCAGGAAGTCAGCCTA-3’) primers were used for cDNA synthesis 
with SuperScript III Reverse Transcriptase (Invitrogen). SHIV envelope sequences 
were amplified by a double-nested PCR approach using the Expand High Fidelity 
PCR System (Roche). First round primers for gp120 were 5’-AAGAGCTCC 
TCCAGACAGTGAG-3’ and 5'-ATGAGTTTTCCAGAGCAACCC-3’ and for 
gp 160 were 5’-AAGAGCTCCTCCAGACAGTGAG-3’ and 5'-CAAGCCCTTGT 
CTAATCCTCC-3’. Second round primers for gp120 were 5'-GAAAGAGCAGA 
AGACAGTGGC-3’ and 5'-ATTGTCTGGCCTGTACCGTC-3’ and for gp160 
were 5’-GAAAGAGCAGAAGACAGTGGC-3’ and 5'-ATGGAAATAGCTCCA 
CCCATC-3’. After second round PCR, all products were spiked with 0.5 pl Taq 
polymerase and incubated for 15 min at 72 °C. Amplicons were excised from a gel 
and purified after cloning into the pCR4-TOPO vector (Invitrogen) and expansion 
in One Shot TOP10 cells at 30 °C. Single colonies were sequenced using M13F/ 
M13R primers as well as primers annealing to the envelope sequence. A consensus 
sequence of each clone was derived using Geneious Pro software (Biomatters), 
and sequence analysis was performed using Geneious Pro and antibody database 
software”'. 

Statistical analyses. Analyses of independent virological and immunological data 
were performed by two-tailed Mann-Whitney U-tests. Analyses of paired data 
sets were performed by two-tailed Wilcoxon matched pairs signed-rank tests. 
Correlations were evaluated by Spearman rank-correlation tests. P values less 
than 0.05 were considered significant. Statistical analyses were performed using 
GraphPad Prism. Exponential decay rates of plasma viral loads were calculated 
using standard ordinary least squares regression on logo (viral load) measure- 
ments versus time (days). 
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Extended Data Figure 1 | Monoclonal antibody titres after administration | PGT121, 3BNC117 and b12 titres in the monkeys described in Fig. 1 after 
of the triple PGT121, 3BNC117 and b12 monoclonal antibody cocktail. infusion of the triple monoclonal antibody cocktail (arrows). 
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Extended Data Figure 2 | Neutralizing antibody titres after administration of the triple PGT121, 3BNC117 and b12 monoclonal antibody cocktail. 
SHIV-SF162P3 and SHIV-SF162P4 serum neutralizing antibody IDso titres in the monkeys described in Fig. 1 after infusion of the triple monoclonal antibody 


cocktail (left) or saline or DEN3 (right). 
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Extended Data Figure 3 | Gag-specific T lymphocyte responses after administration of the triple PGT121, 3BNC117 and b12 monoclonal antibody cocktail. 
Gag-specific IFN-y* CD8* (left) and CD4" (right) T-lymphocyte responses by multiparameter intracellular cytokine staining assays in the monkeys described in 
Fig. 1 after infusion of the triple monoclonal antibody cocktail. 
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Extended Data Figure 4 | Monoclonal antibody titres after administration 
of the double PGT121 and 3BNC117 monoclonal antibody cocktail. 
a-c, PGT121, 3BNC117 and b12 titres in the monkeys described in Fig. 2 that 
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Extended Data Figure 5 | Monkey anti-human antibody titres after anti-PGT121, anti-3BNC117 and anti-DEN3 antibodies at week 6 and week 10 
monoclonal antibody administration. ELISAs assessing anti-b12, after monoclonal antibody infusion in the monkeys described in Fig. 2. 
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Extended Data Figure 6 | Neutralization sensitivity of SHIV-SF162P3 
pseudovirus and our SHIV-SF162P3 challenge stock. TZM-bl neutralization 
assays of PGT121, 3BNC117 and b12 against the SHIV-SF162P3 pseudovirus 
(top) and against the SHIV-SF162P3 challenge stock (middle, bottom). Note 
sensitivity of our SHIV-SF162P3 challenge stock to PGT 121 but relative 


resistance to 3BNC117. 
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Extended Data Figure 7 | Monoclonal antibody titres after administration 


3BNC117 titres in the monkeys described in Fig. 3 after infusion of the single 
of the single monoclonal antibodies PGT121 and 3BNC117. PGT121 and 


monoclonal antibodies. 
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Extended Data Figure 8 | Virus replicative capacity at baseline and following virus rebound. Numbers of GFP-positive infected GHOST indicator cells per 
well after 3 and 6 days of culture with baseline or rebound SHIV-SF162P3 virus. 
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Extended Data Figure 9 | Monoclonal antibody sensitivity to N332A 
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mutated SHIV-SF162P3. TZM-bl neutralization assays of PGT121, PGT124, 
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Extended Data Table 1 | Viral decay kinetics 


Therapy r (logs/day)* Median Fold decrease in 
Median (IQR) ty (days)" | viral load over a 
7-day period 
DTG + TNF/FTC 0.229 (0.198-0.265)" | 1.31 40 
SIV/monkeys 
EFV +2 NRTI 0.294 (0.273-0.334) | 1.02 112 
HIV/humans 
RAL + TDF/FTC 0.264 (0.253-0.284) | 1.15 70 
(ATV/humans) 
PGT121 mAb 0.382 (0.338-0.540) | 0.78 468 
(SHI V/monkeys) 


Kinetics of decline of plasma viraemia after antiretroviral therapy in rhesus monkeys and humans and after monoclonal antibody administration in rhesus monkeys. DTG, dolutegravir; EFZ, efavirenz; FTC, 
emtricitabine; NRTI, nucleoside reverse transcriptase inhibitor; RAL, raltegravir; TDF, tenofovir disoproxil fumarate; TNF, tenofovir. 

* Computed using viral load measurements at day O and day 12 (VJ.B.W., unpublished). 

+ In the case of EFV and RAL therapies, decline rates r correspond to the ‘first phase’ of viral decline. Values reported in ref. 28 have been converted to logs/day (base 10). Note that the specific rate of decline due to 
RAL is slower than that due to EFV. The rapid viral decline in RAL compared to EFV is due to a longer duration in the first phase and a slower transition into the second phase, where viral decline rates are lower. 
#The half-life, t1/2 = In(2)/r*In(10). 
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RNA catalyses nuclear pre-mRNA splicing 


Sebastian M. Fica!?*, Nicole Tuttle**, Thaddeus Novak“, Nan-Sheng Li*, Jun Lu’, Prakash Koodathingal’, Qing Dai’, 


Jonathan P. Staley” & Joseph A. Piccirilli** 


In nuclear pre-messenger RNA splicing, introns are excised by the spliceosome, a dynamic machine composed of both 
proteins and small nuclear RNAs (snRNAs). Over thirty years ago, after the discovery of self-splicing group II intron 
RNAs, the snRNAs were proposed to catalyse splicing. However, no definitive evidence for a role of either RNA or protein 
in catalysis by the spliceosome has been reported so far. By using metal rescue strategies in spliceosomes from budding 
yeast, here we show that the U6 snRNA catalyses both of the two splicing reactions by positioning divalent metals that 
stabilize the leaving groups during each reaction. Notably, all of the U6 catalytic metal ligands we identified correspond 
to the ligands observed to position catalytic, divalent metals in crystal structures of a group II intron RNA. These findings 
indicate that group II introns and the spliceosome share common catalytic mechanisms and probably common 
evolutionary origins. Our results demonstrate that RNA mediates catalysis within the spliceosome. 


Nuclear pre-mRNA splicing (Fig. 1a) is a crucial determinant of the 
export, translation, stability and diversity of eukaryotic messages’, but 
the spliceosome is the only major cellular machinery’ required for 
gene expression for which the catalytic components remain undefined. 
Nevertheless, for three decades, there has been widespread speculation 
that nuclear pre-mRNA splicing is catalysed by RNA. 

This speculation arose from the discovery of self-splicing RNAs, the 
identification of snRNA components of the spliceosome, and the find- 
ing that pre-mRNA introns and group II introns both splice through 
an intermediate having a lariat structure** (Fig. 1a). Since then, genetic, 
biochemical and NMR data have shown that the snRNAs share functional 
and structural similarity with the catalytic core of group II introns*"*. 
Similarly to the catalytic domain V of group II introns, U2/U6 helix Ib 
and the intramolecular stem-loop (ISL) of U6 adopt a secondary struc- 
ture having a conserved bulge and AGC triad sensitive to phosphor- 
othioate substitutions and important for both steps of splicing® *’°"'* 
(Fig. 1b, c). Extending the parallel, a recent crystal structure ofa central 
splicing factor, Prp8, revealed domains similar to those found in cofac- 
tors of group II introns”. 

Consistent with a catalytic role for RNA in the spliceosome, in the 
absence of spliceosomal proteins, U2 and U6 can base-pair and fold 
in vitro into a structure that catalyses reactions similar to the two steps 
of pre-mRNA splicing'*”’, although the relevance of such protein- 
free, minimal model systems for understanding spliceosomal catalysis 
has been questioned (ref. 20 and compare with ref. 21). 

Whether or not through RNA, the catalytic centre of the spliceo- 
some, like that of group II introns, was proposed to catalyse the two 
phosphotransesterifications of splicing by positioning two catalytic 
metals”. In this two-metal mechanism”, one metal would stabilize 
the nucleophile and the second metal would stabilize the leaving 
group (Fig. la). Indeed, in human spliceosomes, as well as group II 
introns”, divalent metals stabilize the leaving group during each step 
of splicing”. 

Intriguingly, recent crystal structures of a group II intron have 
revealed that domain V utilizes five non-bridging phosphate oxygens 
to coordinate two metals 3.9 A apart*®?”—ideally positioned to effect 
catalysis by the two-metal mechanism”. By analogy, the snRNAs have 


been suggested to similarly position metals, consistent with early 
phosphorothioate substitution studies in U6 (refs 12, 13). However, 
only residue U80, situated in the U6 ISL, has been shown to interact 
with a metal’**® and it has remained unclear whether U80 positions a 
structural or a catalytic metal. Thus, despite work highlighting sim- 
ilarities between self-splicing RNA and the snRNAs, there is still no 
direct evidence that the snRNAs mediate splicing catalysis. 

Definitive evidence for a direct role for metals coordinated by the 
RNA in the catalysis of self-splicing group I introns has come from 
metal rescue strategies””*’. These approaches, validated by subsequent 
structural studies*'”’, enabled the direct linkage of metal ligands in the 
ribozyme to the splice sites. Application of such strategies in an invest- 
igation of pre-mRNA splicing has been hindered by proofreading and 
discard mechanisms that compete with catalysis during both steps of 
splicing’*****. Here by disabling such proofreading, we implemented 
metal rescue strategies in the fully assembled spliceosome and iden- 
tified the direct effectors of splicing catalysis. Our results provide defini- 
tive evidence that snRNAs interact with the splice sites directly through 
catalytic metals during both chemical steps of splicing, establishing 
that the spliceosome utilizes RNA to catalyse splicing. 


U6 binds metals during both splicing steps 


To identify components that mediate metal catalysis in the spliceo- 
some, we used metal rescue approaches”' (Fig. 2a) to find metal ligands 
that function at the catalytic stage and subsequently test these ligands 
for a direct role in positioning catalytic metals (Fig. 1a). The similarities 
between U6 and the catalytic domain V of group II introns suggest U6 
as the best candidate for providing metal ligands that function at the 
catalytic stage (Fig. 1b, c). Indeed, early sulphur substitution studies 
implicated eight oxygens in the phosphodiester backbone of U6 as 
important for splicing’** (Fig. 1b). Although informative, these stud- 
ies assayed sulphur substitutions only at the pro-R, phosphate oxygen 
(phosphorothioate substitutions referred to as PS(R,), Fig. 2a) and 
subsequent studies have only revealed rescue by thiophilic metals for 
U80-PS(S,)'4"*. 

To identify metal ligands, we assayed splicing in Saccharomyces 
cerevisiae extracts reconstituted with U6 bearing individual sulphur 
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Figure 1 | Chemistry of pre-mRNA splicing and U2/U6 model showing 
sites sensitive to sulphur substitutions and rescued by thiophilic metal. 

a, Reaction scheme (top) and transition state diagrams (bottom) for the two 
steps of nuclear pre-mRNA splicing. b, c, Models of the U2/U6 structure from 
the spliceosome (b) and domain V of a group II intron (c). Nucleotides where 
sulphur substitutions were shown previously to interfere with splicing’*’* are 
coloured red; underlined U6 nucleotides were tested in this study. An asterisk 
indicates those positions where only the pro-S, oxygen was substituted with 
sulphur; boxed nucleotides were found to provide ligands for metals at the 
catalytic stage (Fig. 2, ref. 26). 


substitutions covering both the R, and S, diastereomers at twenty posi- 
tions, on the basis of previous studies on pre-mRNA and group II 
splicing'* '°”° (Fig. 1b, c). To focus on ligand—metal interactions impor- 
tant during catalysis, we restricted our analysis to spliceosomes that had 
already undergone catalytic activation”® by affinity-purifying spliceo- 
somes using tagged Prp19p (Extended Data Fig. la—c). We then assayed 
for rescue in the absence of ATP and soluble factors, thus ensuring that 
spliceosomes had progressed beyond the final ATP-dependent activa- 
tion step, while at the same time eliminating ATP-dependent proof- 
reading mechanisms**™ to enhance the potential for rescue (Extended 
Data Fig. 1d, Supplementary Note 1). 

Five of the twenty tested substitutions conferred strong branching 
defects (Fig. 1b). In addition to the sulphur substitutions G60-PS(R,), 
U80-PS(S,), and U80-PS(R,) (Fig. 2b, h compared with 2d, f, g; first 
lanes) fiche 12-14), the substitutions G78- PS(Sp ) and A59-PS(S,) also 
caused branching defects in extract in Mg”* ‘Eig. 2c, e, first lanes; 
Extended Data Fig. 2a). 

In addition to spliceosomes containing U80-PS(S,) (Fig. 2g; refs 14, 28), 
affinity-purified spliceosomes containing any of the other four sulphur 
substitutions that conferred a branching defect could catalyse branch- 
ing in Cd** much more efficiently than in Mg”* (Fig. 2c-f). Thus, all 
five oxygens sensitive to sulphur substitution bind metals important 
for branching at the catalytic stage. 

To test whether these non-bridging oxygens implicated in branch- 
ing also function in exon ligation, we assembled each of the twenty 
sulphur-substituted spliceosomes on a model substrate, chased spli- 
ceosomes through branching and assayed for exon ligation after affin- 
ity purification to remove ATP. Only spliceosomes containing the five 
sulphur substitutions implicated in branching failed to catalyse exon 
ligation in Mg’* (Fig. 2i, m compared with 2j-l, first lanes; and data 
not shown). In addition to U80-PS(S,) spliceosomes (Fig. 2]; ref. 28), 
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Figure 2 | U6 snRNA positions metals important for both steps of splicing. 
a, Metal rescue strategy, indicating the non-bridging oxygens that were 
substituted with sulphur and the consequence of a substitution for binding of 
Mg’ or the thiophilic metal Cd**. b-m, Cd’* rescues the branching defects 
(b-h) and exon ligation defects (i-m) induced by specific sulphur substitutions. 
Values represent averages; error bars, s.d. (n = 3 for b-h; n = 2 for i-m); no inc, 
no incubation. n, Structure of the domain V metal binding core (PDB 3EOH, 
ref. 26). Residue numbers are shown for the group II intron, with the 
corresponding U6 residues in parentheses. 


A59-PS(S,) spliceosomes catalysed exon ligation in Cd** (Fig. 2)). 
Although G60-PS(R,), G78. PS(S,) and U80-PS(R,) failed to catalyse 
exon ligation even in Cd’* (Fig. 2k; data not show these sulphur 
substitutions exhibited more stringent requirements for metal rescue 
in branching, failing to tolerate the less thiophilic Mn?*, whereas 
U80-PS(S,) and A59-PS(S,) spliceosomes did (Extended Data Fig. 2b). 
Therefore the defects of these sulphur substitutions that could not be 
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rescued in exon ligation remain consistent with a role for the substi- 
tuted oxygens as metal ligands during exon ligation (Supplementary 
Note 2). Thus, at least two of the oxygens that bind a metal important 
for branching also bind a metal important for exon ligation. 

Overall, our analysis shows that five non-bridging oxygens in U6 
are bona fide metal ligands in the spliceosome (Fig. 2). These ligands 
function after catalytic activation, implicating a structural or direct 
catalytic role for these ligands during the catalytic stage. These ligands 
correspond directly and stereospecifically to the oxygens that coord- 
inate two divalent metals in domain V ofa group II intron, as revealed 
by X-ray crystallography**” (Fig. 2n). This parallel suggested that the 
U6 ligands that function at the catalytic stage function analogously to 
domain V metal ligands by directly positioning the catalytic metals 
required for splicing. 


U6 positions catalytic metals during branching 


Experiments with the Tetrahymena thermophila group I intron have 
established biochemical signatures for identifying ligands that posi- 
tion catalytic metals: sulphur substitution of ligands to a catalytic metal 
can be rescued more strongly by thiophilic metal or with increased 
specificity for Cd’* when the substrate ligands to that metal are also 
substituted with sulphur*’. Consequently, to determine whether the U6 
snRNA metal ligands function catalytically during branching, we first 
identified a substrate sensitive to the identity of catalytic metals required 
for branching. A pre-mRNA substrate bearing a double sulphur sub- 
stitution at both the leaving group and the non-bridging pro-R, oxygen 
at the 5’ splice site (referred to as 3'S-PS(R,)) was branched efficiently 
only in the presence of Cd** (Fig. 3a), indicating that in yeast, as in 
mammals™ (compare with ref. 37), divalent metals interact with the 
scissile phosphate (see also Supplementary Notes 3 and 4 and Extended 
Data Figs 3 and 4). We used the 3’S-PS(R,) substrate as a reporter for 
catalytic metal interactions between the spliceosome and the 5’ splice 
site during branching. 

These experiments revealed several catalytic interactions between 
U6 and the 5’ splice site. Under sensitized conditions (Fig. 3 legend), 
the 3’S-PS(R,) substrate strongly improved Cd’ * -mediated rescue for 
G78-PS(S,) and U80-PS(R,) spliceosomes (Fig. 3b). In contrast to the 
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3'S-PS(R,) substrate, the 3’S-PS(S,) substrate, having the pro-S, oxy- 
gen, which is not directly involved in catalysis, substituted with sul- 
phur (Extended Data Fig. 3), did not improve Cd** -mediated rescue 
for G78-PS(S,) and U80-PS(R,) (Extended Data Fig. 5a, b). These 
data indicate that the G78 pro-S, and U80 pro-R, oxygens interact 
with the 5’ splice site through catalytic metals during branching. 

The 3’'S-PS(R,) substrate also rescued sulphur substitutions that 
could not be rescued on their own. Substitution of both the pro-S, and 
pro-R, oxygens at U80 with sulphur (U80-PS,) impaired branching of 
the 3'O-PO substrate, but unlike the individual substitutions, Cd?* 
could not rescue branching (Fig. 3c). However, the 3'S-PS(R,) sub- 
strate allowed robust Cd*~ rescue of branching for U80-PS, (Fig. 3c), 
providing evidence that both of the U80 non-bridging oxygens bind 
catalytic metals during branching. 

The 3'S-PS(R,) substrate improved rescue of these sulphur substi- 
tutions in U6 in a specific manner. The modified substrate did not 
improve branching for U6 variants that compromised branching due 
to base mutations (Extended Data Fig. 5c, d, Supplementary Note 5) 
and did not improve Cd?* rescue for G60-PS(R,) (Fig. 3b). Thus the 
enhanced rescue of G78-PS(Sp), U80-PS(R,) and U80-PS, spliceo- 
somes conferred specifically by the 3’S-PS(R,) substrate bears the 
hallmark of ligands linked by common metals*”*’ and indicates that 
the corresponding oxygens in U6 position catalytic metals during 
branching. 

An implication of this conclusion is that a sulphur substitution in 
U6 might also reduce the Cd** concentration required for rescue of a 
substrate with sulphur substitutions at the 5’ splice site, by enhancing 
Cd** occupancy of a common metal binding site. Indeed, with the 
3'S-PS(R,) substrate, substitution of the pro-Sp, oxygen of U80 with 
sulphur decreased the Cd’* titration midpoint for rescue by sixfold 
compared to wild-type U6 (Fig. 3d, Supplementary Note 6), and at a 
limiting Cd** concentration (10 1M), increased the rate of branching 
sevenfold compared to wild-type U6 (Extended Data Fig. 6a, b), dem- 
onstrating that U80-PS(S,) increased binding of Cd?* at the catalytic core. 

Our analysis has also revealed that branching requires two distinct 
catalytic metals (compare with ref. 22). With the 3’S-PS(R,) substrate, 
Cd?” titrations indicated that rescue required binding of Cd** to two 
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Figure 3 | U6 snRNA positions catalytic metals during branching. a, The 
3’S-PS(Rp) substrate is sensitive to catalytic metal interactions. b, c, The 3’S- 
PS(R,) substrate improves rescue of G78-PS(S,), U80-PS(R,) and U80-PS, 
spliceosomes. d, U80-PS(S,) reduces the Cd?" concentration required for 
branching of the 3'S-PS(R,) substrate. e, The 3‘S-PS(Rp,) substrate is rescued by 


Cd?* bound at two distinct sites. In d and e, Hill fits are shown (solid curves); in 
e, titration midpoints are indicated. In b and e, a lower pH (pH7.0) was used 
compared to Fig. 2 (pH 8.5) to sensitize the system for improved rescue. Values 
represent averages; error bars, s.d. (n = 3); no inc, no incubation. 
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sites, when Cd?* was the only thiophilic metal present (Fig. 3e). In 
contrast, when Mn2* was also present, rescue still required binding of 
Cd** but to only one site (Fig. 3e; Extended Data Fig. 7a-c), dem- 
onstrating that one metal site could bind Mn** whereas the other 
site could not. Given that Cd** is expected to bind a site containing 
several sulphur ligands, whereas Mn°” is not®’, these and additional 
data indicate that one metal interacts with both sulphur atoms of the 
3'S-PS(R,) substrate (referred to as M1) (Extended Data Fig. 7e, f), 
whereas the second metal only interacts with the non-bridging R, 
sulphur (M2). Further evidence indicates that the U80 pro-R, and 
G78 pro-Sp oxygens also interact with the M1 site, whereas the U80 
pro-Sp oxygen interacts with the second metal site (M2) (Extended 
Data Figs 7 and 8, Supplementary Note 7). Thus, at least three of the 
five identified U6 ligands coordinate two distinct catalytic metals that 
interact with the scissile phosphate of the 5’ splice site during branch- 
ing (Supplementary Note 8, Extended Data Figs 5e—-g and 6e). 


U6 positions a catalytic metal during exon ligation 


To determine whether the U6 metal ligands function during exon 
ligation by directly binding catalytic metals, we tested for biochemical 
signatures”! as above, that would link the U6 ligands to the catalytic 
metal that interacts with the 3’ splice site during exon ligation. 

First, we introduced a sulphur substitution at the 3’ splice site leav- 
ing group to sensitize splicing of a model substrate to binding of the 
catalytic metal that interacts with this group. Additionally, we intro- 
duced a mutation of the 3’ splice site consensus sequence (UAG to 
UAc) to stall spliceosomes before exon ligation** regardless of whether 
the 3’-oxygen leaving group at the 3’ splice site was substituted with 
sulphur or oxygen (3’S or 3’O). Exon ligation of this substrate can 
proceed only in the absence of ATP due to proofreading™. Although 
exon ligation of affinity-purified UAc-3'O spliceosomes proceeded in 
the absence of ATP, the 3’S substitution compromised exon ligation 
when only Mg** was present (Fig, 4a). When Mn?* or Cd** was present, 
spliceosomes assembled on the UAc-3’S substrate catalysed exon liga- 
tion fourfold to sixfold more efficiently (Fig. 4a), indicating that a 
divalent metal stabilizes the leaving group in yeast, as in mammals” 
(see also Supplementary Note 9 and Extended Data Fig. 9a-f). Because 
the UAc-3’S substrate specifically required thiophilic, catalytic metals, 
we used it as a reporter for catalytic metal interactions between the 
spliceosome and the 3’ splice site during exon ligation (Supplementary 
Note 10). 

After assembling spliceosomes containing U6 sulphur substitu- 
tions on this sulphur-substituted substrate, we probed for exon liga- 
tion at the catalytic stage (that is, in the absence of ATP and soluble 
factors). As expected from splicing assays of the wild-type UAG-3’O 
substrate (Fig. 2j, 1), exon ligation of the mutated UAc-3’O substrate 
in Mg”* occurred with exceedingly low efficiency with spliceosomes 
containing U6 A59-PS(S,) or U80-PS(S,) (Fig. 4b). However, in con- 
trast to exon ligation assays of the wild-type UAG-3’O substrate 
(Fig. 2), 1), exon ligation of the mutated UAc-3'O substrate by these 
sulphur-substituted spliceosomes was not rescued by Cd’* (Fig. 4b), 
indicating that Cd** could no longer rescue the U6 sulphur substitu- 
tions in the context of the substrate base mutation. Nevertheless, when 
the substrate also contained sulphur in the leaving group position 
(UAc-3’S), Cd’*, but not Mg", strongly stimulated exon ligation by 
the sulphur-substituted spliceosomes (Fig. 4b, Extended Data Fig. 9g). 
This stronger rescue by Cd** of a sulphur-substituted ligand in the 
presence of a second sulphur-substituted ligand constitutes a signature 
for a functional and physical link between two ligands that bind the 
same metal” **. These data therefore establish that the U6 A59 pro-S, 
and U80 pro-S, oxygens interact with the catalytic metal that stabilizes 
the leaving group during exon ligation. 

The signature was specific to the U6 A59-PS(S,) or U80-PS(S,) 
spliceosomes and was not observed for other spliceosomes that were 
compromised for exon ligation. Unlike the U6 sulphur-substituted 
spliceosomes assembled on the UAc-3’S substrate, the exon ligation 
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Figure 4 | U6 snRNA positions a catalytic metal during exon ligation. a, The 
UAc-3’S substrate is sensitive to catalytic metal interactions. b, c, The UAc-3’S 
substrate specifically improves rescue of U6 A59-PS(Sp) and U80-PS(S,) 
spliceosomes. Values represent averages; error bars, s.d. (m = 3); no inc, no 
incubation. 


defect*’ of U6 A51c spliceosomes was not substantially suppressed in 
cd?* by the UAc-3’S substrate (Fig. 4c). Furthermore, the exon liga- 
tion defect of G60-PS(R,) was not suppressed by the UAc-3’S sub- 
strate in Cd** (Fig. 4c; Extended Data Fig. 9h; Supplementary Note 11), 
indicating that the substrate sulphur substitution alone does not enable 
exon ligation of compromised spliceosomes. These results and a metal 
specificity switch induced by U80-PS(S,) (Extended Data Fig. 8 and 
Supplementary Note 12) validate the evidence that the U6 A59 pro-S, 
and U80 pro-S, oxygens interact with a catalytic metal during exon 
ligation. 

Through a comprehensive analysis, encompassing every substrate 
allowed by sulphur chemistry (Extended Data Figs 5g and 9h), we 
have found that four of the five metal ligands in U6 position catalytic 
metals during the catalytic stages of splicing. Even though metal res- 
cue strategies were insufficient to establish direct evidence linking the 
fifth ligand (G60 pro-S,) to a splice site (Supplementary Note 2), the 
configuration of the RNA catalytic core implied by our data are con- 
sistent with the fifth ligand also binding a catalytic metal (see below). 
Importantly, at least two of the U6 metal ligands function in both steps 
of splicing and at least one of these, U80 pro-S,, positions a catalytic 
metal during both branching and exon ligation (M2, Fig. 5a, b). These 
and other findings (Supplementary Note 13) support a model”? in 
which both splicing reactions are catalysed by a common, two-metal 
catalytic core, rather than two independent active sites, and thereby 
implicate a rearrangement of the substrate to sequentially accommod- 
ate mutually exclusive interactions between U6 and the 5’ and 3’ splice 
sites. Further underscoring the importance of disabling fidelity mecha- 
nisms in our approach, these results establish that the spliceosome 
proofreads catalytic interactions (compare with ref. 28). 


Discussion 


Our results indicate that the metals that mediate splicing catalysis 
through interactions with the scissile phosphates are bound by the 
spliceosome through ligands in the U6 snRNA, thus demonstrating 
that RNA directly mediates catalysis in the spliceosome (Fig. 5d). 
Notably, all of the five U6 metal ligands that function in pre-mRNA 
splicing (Fig. 2) correspond directly to catalytic metal ligands observed 
in domain V in structures ofa group II intron (Figs 1b, c; 2n; 5¢; refs 26, 27). 
Moreover, our findings imply an equivalent orientation of the substrate 
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Figure 5 | Model for catalytic metal interactions during pre-mRNA splicing 
and comparison to the domain V catalytic core of group II introns. 

a, b, Catalytic metal interactions during branching (a) and exon 

ligation (b). c, Model of domain V during hydrolysis (PDB 4FAQ, ref. 27). 

d, Two-metal model for the RNA catalytic core of the spliceosome. For 
branching, R1 represents the 2’ hydroxyl of the branch adenosine; R2, the 
intron; and R3, the pro-S, oxygen. For exon ligation R1 represents the 3’ oxygen 
leaving group, R2 the pro-S, oxygen; and R3 the 3’ exon. Throughout the 
reactive oxygens are coloured red, the pre-mRNA scissile phosphate is depicted 
in a transition state, and interactions between specific ligands and the reactive 
oxygens mediated by M1 and M2 are shown as light magenta dashed lines. 


relative to the metal binding core (Fig. 5a—c). Taken together, our data 
support a model for a single, two-metal active site (Fig. 5d; Supplemen- 
tary Note 13; refs. 22, 40) and validate (compare with ref. 25) the long- 
standing hypothesis that the spliceosome shares a similar RNA-based 
catalytic core and mechanism with group II introns*°"?"8?*?741 (Sup- 
plementary Note 14). This RNA-based mechanism is sufficient to 
effect metal catalysis of pre-mRNA splicing, without the need for direct 
protein involvement. 

Given this evidence, it is noteworthy that the RNaseH-like domain 
of Prp8, which interacts with all reactive sites of the substrate, can bind 
a metal in crystallo*’. Further, mutations that compromise metal bind- 
ing in crystallo impair exon ligation, leading to the possibility that 
Prp8 may play a catalytic role during exon ligation’”. Nevertheless, we 
have not found any evidence for a direct metal-mediated catalytic 
interaction between the Prp8 metal binding site and the 3’ splice site 
(Supplementary Note 15; Extended Data Fig. 10). Thus, Prp8 may 
primarily promote formation of the RNA catalytic core without being 
a part of it; indeed, a recent structure comprising most of Prp8 has led 
to the suggestion that Prp8 forms a scaffold for the spliceosome’s 
catalytic core, analogously to protein co-factors of group II introns’”. 
A recent model of the catalytic core of the human spliceosome based 
on RNA structure probing and homology modelling provides further 
support for this interpretation*. 

Overall, our data indicate that the spliceosome, like the ribosome 
uses RNA to effect catalysis in the context of a complex ribonucleo- 
protein assembly. Moreover, the common catalytic mechanism used 
by the spliceosome and group II introns is consistent with a common 
evolutionary origin between the spliceosome and these ancient RNA 
retroelements**””. Our findings thus support the idea that modern 
ribonucleoprotein enzymes evolved from a primordial “RNA world’ 
(ref. 48), in which catalysis was performed exclusively by RNA. 
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METHODS SUMMARY 


U6 was depleted from Saccharomyces cerevisiae splicing extracts and splicing activ- 
ity reconstituted with synthetic U6 snRNA, essentially as described”*. Spliceosomes 
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were assembled on modified model pre-mRNA substrates: UBC4, for experiments 
probing branching, or ACT1, for experiments probing exon ligation, both synthe- 
sized by splint-mediated ligation. Oligonucleotides containing specific 5’ or 3’ 
splice site modifications were synthesized in house, as described previously”. Assem- 
bled spliceosomes were isolated by affinity purification via Prp19p (ref. 36), 
washed to remove ATP, and chased as described’*, at room temperature, in the 
absence of ATP, at pH 7.0, 8.0 or 8.5. All experiments were repeated with at least 
two independent extract preparations. Data were quantified using ImageQuant TL 
(Amersham Biosciences). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Strains. Experiments probing branching were performed using S. cerevisiae strain 
yJPS1405, which was derived from BY4741 (MATa his3A1 leu2A0 met15A0 
ura3A0, Open Biosystems), by integrating a biotinylation signal C-terminal to 
the PRP19 locus using KanMX6 as a selective marker; the integration fragment, 
including the biotinylation signal, an ADH1 terminator, and the KanMX sequences 
were amplified from pFA6-HTB-KanMx (ref. 51). 

Experiments probing exon ligation were performed using the previously described 
S. cerevisiae strain yJPS860 containing a C-terminal TAP tag on Prp19 (ref. 34). 

Experiments involving Prp8 were performed using S. cerevisiae strains yJPS1471 

(PRP8 wild-type) and yJPS1472 (PRP8 D1853C), both derived from yJPS860 
(ref. 34). Briefly, the entire coding sequence of PRP8 was replaced in yJPS860 with 
LEU2, after transformation with PRP8 on a URA3 plasmid (bJPS1874), yielding 
yJPS1470. Then yJPS1471 and 1472 were derived from yJPS1470 by plasmid 
shuffle on 5-FOA after transformation with bJPS2263 (PRP8 wild-type on a 
HIS3 plasmid, same as pJU186, ref. 52) or bJPS2589 (PRP8 D1853C). bJPS2589 
was derived from pJU186 (ref. 52) by QuikChange mutagenesis and confirmed by 
sequencing. 
Pre-mRNA splicing substrates. Branching experiments were performed using 
UBC4 pre-mRNA® truncated at the 5’ and 3’ ends down to 20 nucleotide exons. 
Exon ligation experiments were performed using ACT1 pre-mRNA. Pre-mRNA 
substrates were prepared by splint-mediated ligation” using T4 RNA ligase 2 
(NEB) or T4 DNA ligase (in house). 

Oligonucleotides UBC4-5’E, 5’Ev2, UBC4-M, and UBC4-M1 were purchased 
from Dharmacon. Oligonucleotide UBC4 M2, bearing various modifications, was 
synthesized in house” and purified by HPLC on a DNAPac P-100 column using 
the Waters 2795 system”. Oligonucleotides ACT1-3’O-pc, ACT1-3'S-pc, ACT1- 
UAc-3'0, and ACT1-UAc-3’S were synthesized in house. The correct identity 
(position and chirality) of specific sulphur modifications was verified by mass 
spectrometry, silver or iodine cleavage, and analytical T1 digestion® (data not 
shown). Oligos M, M1, and M2 were phosphorylated with unlabelled ATP before 
ligation. The UBC4-TX transcript was synthesized by in vitro transcription using 
a PCR-derived template that started transcription at position +37 of the intron 
and ended at the last nucleotide of the exon; the template was generated using 
primers T7 UBC4 37-135 F and HindIII UBC4 R. The transcript was gel puri- 
fied, treated with calf inorganic phosphatase (NEB), and 5’ phosphorylated with 
[y-?P] ATP (Perkin Elmer, 6,000 C mmol!) before ligation. 

The ACT1-5’-end plasmid was constructed by stepwise PCR from plasmid 
pJPS149 (ref. 34) to generate a DNA template containing an EcoRI site, the T7 
promoter, ACT1 nucleotides 1-373, an HDV ribozyme sequence” and a HindIII 
site. The template was then cloned into vector pUC19. The ACT1-1-373 tran- 
script was synthesized by in vitro transcription from the ACT1-5'-piece plasmid 
linearized with HindIII. In cases where HDV cleavage was inefficient during 
transcription, the RNA was resuspended in 10mM Tris (pH7.5) and 20 mM 
MgCh. Ribozyme cleavage was induced by 2-4 cycles of 90°C for 1 min, room 
temperature for 15 min and 37°C for 15 min. The buffer conditions were then 
adjusted for T4 PNK (NEB) treatment of the transcript to remove the 2’-3' -cyclic 
phosphate left by the ribozyme. The ACT1-392-590 transcript was synthesized 
by in vitro transcription using a PCR-derived template generated using plasmid 
bJPS149 (ref. 34). As the subsequent ligation requires a 5’-monophosphate group, 
a fourfold excess of GMP over GTP was included in the transcription reaction. 

For a typical UBC4 ligation reaction, 20 pMol of UBC4 5’E, 10 pMol of UBC4 
M, M1, or M2 and 20 pMol of UBC4 TX were hybridized to 10 pMol of UBC4 
splint in buffer TEN (10 mM Tris-HCl, pH 7.5; 1 mM EDTA; 66 mM NaCl) ona 
thermal cycler by heating to 90 °C for 2 min followed by reduction of the tem- 
perature by 1 °C for 1 min in 72 sequential steps, to a final temperature of 18 °C. 
T4 RNA ligase 2 (10-20 U) was then added and reactions were incubated at 37 °C 
for 6h. Ligated, full-length UBC4 pre-mRNA was purified on an 8% denaturing 
polyacrylamide gel and recovered by passive elution in TE buffer at 4 °C overnight 
(10 mM Tris-HCl pH 7.5; 1 mM EDTA). 

For a typical ACT1 ligation reaction, 500 pMol of ACT1-1-373, 50 pMol of 
oligonucleotide ACT1-3’O and 500 pMol of ACT1-392-590 were hybridized to 
50 pMol of ACT1 splint in buffer TENs, (10 mM Tris-HCl, pH 7.5; 1 mM EDTA; 
50mM NaCl) on a thermal cycler by heating to 90°C for 2 min followed by 
reduction of the temperature by 1°C for 1 min to 24°C, then cooling to 4°C 
for 5 min. T4 DNA ligase (~ 100 pMol, synthesized in house) was then added and 
reactions were incubated at 37°C for 4h. The ligation reactions were DNase 
treated (RNase-free DNase, Promega) for 15 min to remove splint, phenol-chlo- 
roform extracted and ethanol precipitated before purification on 6% denaturing 
polyacrylamide gel. Bands containing full-length ACT pre-mRNA were excised 
and recovered by passive elution in TEN2s59 buffer (10 mM Tris-HCl pH7.5, 
1mM EDTA, 250 mM NaCl) overnight at 4°C. 
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Splicing extracts and U6 depletion and reconstitution. Splicing extracts 
were prepared using the liquid nitrogen method, as described*. U6 depletion 
and reconstitution was performed essentially as described’*, with the following 
modifications. The U6 d1 oligonucleotide was titrated for each extract to optimize 
depletion and reconstitution of U6; typically, 0.8 1M U6 dl was optimal. After 
depletion of U6, to promote degradation of the U6 d1 oligonucleotide before 
addition of the synthetic U6, DNase I (Ambion, 0.05U pl? final) was added 
together with the glucose used to deplete ATP. Additionally, to ensure complete 
inactivation of the U6 d1 oligonucleotide, an oligonucleotide antisense to U6 d1 
(U6 ad1, 0.3 UM final)*” was added immediately before reconstitution and incu- 
bated on ice for 5 min. For reconstitution U6 was added back to a final concen- 
tration of 0.2-0.3 1M. Modified U6 was constructed by splint-mediated ligation 
essentially as described’. Wild-type U6 was synthesized by in vitro transcription 
according to standard procedures using pJPS488 linearized with Dral as tem- 
plate**. There was no difference in splicing efficiency between spliceosomes recon- 
stituted with wild type U6 made by ligation or by in vitro transcription (data not 
shown and Extended Data Fig. 6b). 

In vitro splicing. In vitro splicing was performed essentially as described”* using 
32D_labelled substrates (0.2-0.4nM). In Fig. 2c, g, extracts were pre-incubated 
with 2mM EDTA at 4 °C for 30 min before assembling the reactions; MgCl, was 
adjusted to 3.3 mM during the splicing reaction (compared to standard 2.5 mM) 
to compensate for the final EDTA concentration (0.8 mM). Reactions were usu- 
ally incubated in extract for 25 min before affinity purification of assembled 
complexes, except for experiments in Fig. 3a and Extended Data Fig. 3, where 
initial incubation was performed for 10 min. Assembled and stalled complexes 
were affinity-purified, via tagged Prp19p, by incubating the splicing reactions for 
1-3h at 4°C with a 10%-20% reaction volume of streptavidin-agarose (Thermo 
Scientific) or IgG-Sepharose (GE) slurry pre-washed twice with 25-50 volumes of 
IPPj59 (10 mM Tris-HCl pH 8.0; 150 mM NaC]; 0.01% NP-40 substitute (Fluka)). 
Following immunoprecipitation, beads were washed at 4 °C twice with 50 volumes 
of buffer PK (3% PEGgoo9, 60 mM potassium phosphate, pH 7.0), and spliceo- 
somes were assayed for splicing in the absence of ATP (except where noted), in 
buffer PK (pH 7.0, 8.0, or 8.5, as indicated) with various amounts of the indicated 
metals and EDTA (where noted) at room temperature (23-24 °C) with constant 
rotation, for 60 to 90 min. Metal solutions were prepared from corresponding 
powdered salts of at least 99.99% purity (Sigma). 

Affinity-purified spliceosomes were incubated as follows. In Figs 2c and 3a - 
buffer PK (pH 8.5, with 0.5mM EDTA) with 1mM MgCl, and 1mM of the 
indicated metals; in Fig. 2d-g, buffer PK (pH8.5) with 0.5mM MgCl, and 
0.5mM of the indicated metals; in Figs 2j-1 and 4b, c, buffer PK (pH 7.0) with 
2mM MgCl, and 0.1 mM CdCl,; in Fig. 2k, | spliceosomes containing A59-PS(S,) 
and U80-PS(Sp) were chased through branching in the presence of 0.1 mM CdCl, 
before affinity purification. In Fig. 3b, e, buffer PK (pH 7.0) with 1mM MnCl, 
alone or in combination with 1 mM CdCl; in Fig. 3c, buffer PK (pH 8.5) with 
0.5mM MgCl, and 0.5mM CdCl,; in Fig. 3d, buffer PK (pH 8.5) with 0.5mM 
MgCl and various amounts of CdCl, as indicated. In Fig. 4a, buffer PK (pH 7.0) 
with 1mM MgCl and 0.1 mM CdCl, or 0.1mM MnCl. 

Following in vitro splicing, products were separated on 15% denaturing poly- 
acrylamide gels for UBC4 substrates or 6% denaturing polyacrylamide gels for 
ACTI1 substrates. All experiments were repeated with at least two independent 
extract preparations. 
Cwc25p depletion and reconstitution. For depletion of Cwc25p, splicing extracts 
were prepared fresh, dialysed against buffer D (ref. 52), and then incubated at 4 °C 
for 1.5 h with 50% volume equivalent of protein A-sepharose (Sigma) (1:1 slurry in 
IPP, 50) conjugated to anti-Cwc25 serum (gift from S. -C. Cheng, ref. 59) and pre- 
equilibrated in buffer D by rotation at 4°C for 15 min. Beads were removed by 
centrifugation at 800g for 4 min and the supernatant was used as depleted extract. 

rCwc25p extended with a C-terminal six-histidine tag was expressed in E. coli 
BL21(DE3)pLysS transformed with pET15b (ref. 59; a gift from S. -C. Cheng). 
Induction in E. coli and purification by Ni°*-NTA affinity chromatography were 
performed essentially as described® but binding and washing was performed 
manually. Following elution from the Ni’*-NTA resin, the protein was further 
purified by glycerol gradient centrifugation to more than 90% purity (as estimated 
by Coomassie blue staining). 

To prepare the heat-soluble extract fraction (HP), yeast splicing extracts were 
incubated at 90°C for 5 min and insoluble material was removed by centrifu- 
gation at 16000g for 5 min. The supernatant was used as HP. 

For complementation of affinity purified spliceosomes from extracts immu- 
nodepleted of Cwc25p, spliceosomes were washed twice with 50 volumes of buffer 
DK (20 mM HEPES-KOH, pH 7.9; 60 mM potassium phosphate, pH 7.0; 50 mM 
NaCl; 0.2 mM EDTA) and splicing was assayed in buffer DK, in the presence of 
rCwc25-6His (0.7-1.411M) and a heat soluble extract fraction (HP; typically 
1-2 tl for a 50 pl splicing reaction). 
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To determine the rates of branching in Extended Data Fig. 4d, reactions were 
pre-incubated with rCwc25p and HP in the absence of metal at room temperature 
for 5 min to allow binding and temperature equilibration. Splicing was initiated 
by addition of metals; aliquots were removed at various times, immediately 
quenched in STOP buffer (50 mM NaOAc, pH5.2; 1mM EDTA; 0.1% SDS; 
0.1 mg ml ' glycogen) and placed on ice before phenol extraction. 
Oligonucleotides. The following oligonucleotides were used for synthesis of 
UBC4 substrates. UBC4 5’E: 5’-GAACUAAGUGAUCUAGAAAGG-3’; UBC4 
5'Ev2: 5'-GAACUAAGUGAUCUA-3’; UBC4 M: 5’-UAUGUCUAAAGUU 
AU-3’; UBC4 M1: 5'-GAAAG(3'S)GUAUGUCUAAAGUUAU-3’; UBC4 M2: 
5'-GAAAG(3’S-PS)GUAUGUCUAAAGUUAU-3’; UBC4 Splint: 5’-CACGCA 
TITGAAACGTGGCCATAACTTTAGACATACCTTTCTAGATCACTTATTC-3' 
(ligation splint); T7 UBC4 37-135 F: 5'-TAATACGACTCACTATAGGCCACG 
TTTCAAATGC-3’ (forward primer to generate the template UBC4 TX); HindIII 
UBC4 R: 5'‘-ATAAGCTTAACATGAAGTAGGTGGATCTC-3’ (reverse primer 
to generate the template for UBC4 TX). 

The following oligonucleotides were used for synthesis of ACT1 substrates. 
ACT1-3'O-pe: 5’-UUUA(2’-0-nitrobenzyl-G)AGGUUGCUGCUUU-3’; ACT1- 
3'S-pe: 5'-UUUA(3’S, 2’-0-nitrobenzyl-G)AGGUUGCUGCUUU-3’; ACT1-UAc- 
3'O: 5'-UUUACACGUUGCUGCUUU-3’; ACT1-UAc-3'S: 5'-UUUA(3’S-C)AG 
GUUGCUGCUUU-3’; ACT1-UAc-splint: 5’-GAACCGTTATCAATAACCAAA 
GCAGCAACGTGTAAACATATAATATAGCAACAAAAA-3’; ACT 1-splint: 5’-G 
AACCGTTATCAATAACCAAAGCAGCAACCTCTAAACATATAATATAGCA 
ACAAAAA-3’; act1-3'-end-for-1: 5'-TAATACGACTCACTATAGGTTATTGAT 
AACGGTTATTG-3' (forward DNA primer to generate ACT1 392-590 transcrip- 
tion template); act1-3’-end-for-2: 5'-GAAATTAATACGACTCACTATAGGTT 
ATTG-3’ (forward DNA primer to generate ACT1 392-590 transcription tem- 
plate); act1-3'-end-rev: 5'-mUmUGGGCTGCAGGTCGAGCTC-3’ (reverse DNA 
primer to generate ACT1 392-590 transcription template); EcoRI+'T7:5'-AGTG 
AATTCCCTTAATACGACTCACTATAGG-3’ (forward DNA primer to gen- 
erate ACT1 1-373-HDV template); Forward long: 5’-CGTAATACGACTCACT 
ATAGGGCGAATTGG-3’ (forward DNA primer to generate ACT1 1-373-HDV 
template); Reverse-A-25: 5’-CGAGGAGGCTGGGAGCATGCCGGCCCATAT 
AATATAGCAACAAAAAGAAT-3' (reverse DNA primer to generate ACT1 
1-373-HDV template); Reverse 0-50: 5’-CCGGAATGTTGCCCAGCCGGCGC 
CGCGAGGAGGCTGGGAGCATGCCGGCC-3’ (reverse DNA primer to generate 
ACT1 1-373-HDV template); Reverse 25-75: 5'-GITGCGTCCCATTCGCCATT 
ACCGGACGGTCCGGAATGTTGCCCAGCCGGCGCCG-3’ (reverse DNA primer 
to generate ACT1 1-373-HDV template); Reverse final: 5'-GTGCGTCCCATTCG 
CCATTACCCG-3’ (reverse DNA primer to generate ACT1 1-373-HDV template). 
Data analysis. Gels were dried, exposed to storage phosphor screens (Amersham 
Biosciences) for 24 to 48h, and scanned using a Typhoon Trio phosphorimager 
(Amersham Biosciences). Bands were quantified using ImageQuant TL with an 
automated rolling ball algorithm for background subtraction. The efficiency of 
branching was calculated as LI/(LI + P), if EI represented less than 1% of all 
species or (LI + EI)/(LI + EI + P) in all other cases; LI, lariat intermediate; EI, 


excised intron; P, pre-mRNA. The efficiency of exon ligation was calculated as 
EI/(EI + LI). 

The rescue midpoint for the metal titration curves in Fig. 3d and Extended Data 
Fig. 6c,d was obtained by fitting the rescue profiles to the general Hill equation 
A= Amaxx"/(x" + K"), where Amax is the splicing efficiency at saturation, K is 
the rescue midpoint, x is the CdCl, concentration, and n is the Hill coefficient. 
Curves in Fig. 3e and Extended Data Fig. 7a were fit to the following equations: 
y = Eo + Ex/(x + Kea) (1 metal model) or y= Eo + Ex*/(x* + Keg’) (2 metal 
model), where y is the branching efficiency, Ep is the extent of branching in the 
absence of Cd?*, E is the extent of branching at saturating Cd?*, and Keg is the 
apparent transition midpoint for Cd’* binding. Fits using the general Hill equa- 
tion: y = Ey + Ex"/(x" + Kea"), where n is the number of metal sites titrated, gave 
the same number of sites (within fit error) as those assuming a fixed n (data not 
shown). 

Initial rates in Extended Data Figs 4d and 6a, b were obtained by fitting the 
linear portion of the splicing time courses to the equation A = Ao + Kinitiait, where 
A is the splicing efficiency at time t and Ap is the extent of splicing at time 0. Rates 
in Extended Data Fig. 9 were obtained by fitting the splicing time courses to the 
equation E = Ej) + A(1 —e *) where E is the extent of reaction, Ep is the extent at 
time = 0, A is the amplitude, k is the rate of reaction, and t is the time in minutes. 
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Extended Data Figure 1 | Affinity purification of spliceosomes with Prp19 
requires reconstitution with U6 snRNA and enhances the potential to detect 
rescue by thiophilic metals. a, b, Schemes depicting experimental strategy for 
staging spliceosomes to monitor branching (a) or exon ligation (b) in the 
absence of ATP. Spliceosomes are depicted as light magenta ovals and Prp19p 
as a magenta circle. Following affinity purification of spliceosomes by Prp19p 
(ref. 36), beads were washed to remove ATP and soluble factors and metals ions 
were added to assay for splicing. c, Prp19p-mediated affinity purification of 
activated spliceosomes, reflected by immunoprecipitation of pre-mRNA, is 
specific for properly reconstituted complexes. Note that the affinity 


purification allows quantification of the branching efficiency for activated 
complexes, independently of any effects on assembly. RNA from 10% of the 
reaction (input, I) or from beads after affinity purification (B) was extracted 
and analysed by denaturing PAGE. Raw data (top); quantification of 
immunoprecipitation efficiency (bottom); Rec., reconstitution. d, ATP 
represses the Cd?" rescue for G78- PS(S,) spliceosomes. Spliceosomes were 
assayed as in Fig. 2e; for lane 4, 2mM ATP-Mg”~ was also present during the 
incubation. Representative gel (left) and quantification of the extent of ATP 
repression (right). Values are averages; error bars represent s.d. (n = 3). 
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Extended Data Figure 2 | Broad rescue specificity of A59-PS(S,) and U80- 
PS(S,) spliceosomes. a, Pre-incubation with EDTA reveals a branching defect 
for A59-PS(S,). Extracts were pre-incubated at 4 °C in the presence or absence 
of 2mM EDTA and then incubated with the 3’O substrate. After affinity- 
purification, branching efficiency was quantified without further incubation 
(bottom). A representative gel is shown (top). EDTA pre-incubation caused an 
eightfold reduction in branching efficiency, indicating that splicing extracts 
may contain a thiophilic metal that supports branching by A59-PS(S,) 
spliceosomes; note that pre-mRNA still immunoprecipitated efficiently, 
indicating catalytic activation of the spliceosome. In contrast EDTA pre- 
incubation has no effect on U6 wild-type spliceosomes (data not shown). b, The 


branching defects for A59-PS(S,) and U80-PS(S,) spliceosomes are rescued by 
either Mn** or Cd’*. Assays were as in Fig. 2c. A representative gel (top) and 
quantification (bottom) are shown; no inc, no incubation. Both Mn?* and 
Cd?* strongly rescue A59-PS(S,) and U80-PS(S,) spliceosomes (lanes 3, 4, 7 
and 8), suggesting that even the weaker Mn?" -S interaction*® at these positions 
can support branching. This broad specificity for branching may also explain 
why A59-PS(S,) and U80-PS(S,) spliceosomes also catalysed exon ligation in 
the presence of thiophilic metals, whereas G60-PS(R,), G78-PS(S,), and U80- 
PS(R,) spliceosomes, for which branching was only rescued in the presence of 
Cd?*, stalled after branching. Values are averages; error bars, s.d. (n = 3). 
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Extended Data Figure 3 | A divalent metal binds the 5’ splice site pro-R, 
oxygen during branching. a, UBC4 pre-mRNAs bearing the indicated 
modifications at the 5’ splice site (top panel) were assayed as in Fig. 3a. The 
band marked * results from 5’ to 3’ exonucleolytic degradation that is blocked 
by the sulphur; ** denotes statistical significance of Cd’* rescue compared to 
Mg** splicing (P = 0.004, paired, one-tailed t-test, n = 3). The data from 
Fig. 3a are reproduced here to aid comparison. Values are averages; error bars, 
s.d. (n = 3); no inc, no incubation. b, Mapping of the site of branching. 
Purified, *’P-labelled, intact lariat intermediates (LI ~ 0.2 fmol) resulting from 
branching of the UBC4 3’O-PO or 3’S-PS(Rp) substrates were used as 
templates for reverse transcriptase (RT) using primer IP3 (~10 fmol), which 


+24 


binds at nucleotides +5 to +24 of the intron (see lower diagram). Lanes 6 and7 
show that the major RT stop occurs at the same position when either the 
3'O-PO or 3’S-PS(R,) lariats are used as template. This stop migrates at 

the expected position, which is the position of the ddC stop resulting 

from extension of primer IP3 with pre-mRNA as template and therefore 
corresponds to position +1 of the intron, the expected branch site. Lower 
diagram shows mapping of the primer and expected RT stop onto the UBC4 
pre-mRNA sequence; bold exon; italics intron. Note that the band marked *** 
is present in the **P pre-mRNA lane and thus is probably a nonspecific band 
resulting from contamination with pre-mRNA degradation products that can 
anneal to the primer and serve as template. 
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Extended Data Figure 4 | The UBC4 3S substrate can be branched 
efficiently only in the presence of thiophilic metals. a, A sulphur at the 5’ 
splice site leaving group alone blocked splicing in extract. Following affinity- 
purification branching efficiency was quantified (bottom) without further 
incubation. Note that immunoprecipitation of pre-mRNA indicated that the 
sulphur did not block the catalytic activation of the spliceosome. b, Thiophilic 
metals rescued the branching defect of the 3’S substrate. Spliceosomes were 
assayed in buffer PK (pH 7.0) in the presence of 2 mM total metal (1 mM MgCl, 
plus 1mM MnCl or 1mM CdCl.) for 1.5h. The bar graph quantifies the 
relative stimulation by specific metals (normalized to Mg*"). c, d, Thiophilic 
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metals specifically stimulate branching of the 3’S, but not the 3’O substrate. 
Affinity-purified spliceosomes from extracts depleted of Cwc25p, to stall 
spliceosomes independent of the sulphur substitution, were incubated as in 
Fig. 3a except that washes and incubation were done in buffer DK without 
EDTA and with 1 mM MgCl; rCwc25 as well as an HP extract fraction were 
also added to complement (Supplementary Note 4; Methods). Quantification 
of the thiophilic metal stimulation relative to Mg”* is shown both for reaction 
endpoints (c) and for the rate of branching (d). Values are averages; error bars, 
s.d. (n = 3). The band marked * results from 5’ to 3’ exonucleolytic degradation 
that is blocked by the sulphur. 
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Extended Data Figure 5 | The 3’S-PS(R,) substrate specifically improves 
rescue for spliceosomes containing U6 sulphur substitutions that 
compromise catalytic metal binding. a, b, Spliceosomes were assayed as in 
Fig. 3b. Note that the 3’S-PS(S,) substrate does not significantly improve ca 
rescue when compared to the 3’O-PO substrate (a), despite having similar 
reactivity to the 3’S-PS(R,) substrate with wild-type U6 (b). c, The U6 double 
mutation U80g/C61g permitted both spliceosome assembly and activation, as 
reflected by the stable association of Prp19p with the splicing substrate. 10% of 
the RNA in the input (I) for the immunoprecipitation or 100% of the RNA 
associated with affinity-purified spliceosomes (B) were analysed by denaturing 
PAGE (top). Total immunoprecipitation efficiency was quantitated for all 
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splicing species combined (bottom). d, The 3’S-PS(R,) substrate did not 
significantly improve splicing for U80g and U80g/C61g spliceosomes. Assays 
were as in Fig. 3b. e, f, Representative raw data for Fig. 3. Assays were as in Fig. 3. 
Ine, for wild-type U6 lanes 1-3 and 4-6 were taken from two different gels, for 
G78-PS(S,) lanes 7-12 and 13-15 were taken from two different gels. In all 
other cases the lanes for different substrates assembled with spliceosomes 
bearing the same U6 modification were taken from the same gel. Values are 
averages; error bars, s.d. (n = 3 for a, b, d; n = 2 for c); no inc., no incubation. 
g, Summary of combinations of sulphur substitutions in U6 and the substrate 
tested for rescue of branching. The + sign indicates that branching was 
observed in the presence of thiophilic metal. 
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Extended Data Figure 6 | U6 snRNA positions catalytic metals during 
branching: controls for the U80-PS(S,) induced shift in the Cd?* transition 
midpoint for rescue of the 3’S-PS(R,) substrate. a, The shift induced by U80- 
PS(S,) in the cd?** midpoint for rescue was also observed when reaction rates, 
instead of amplitudes, were compared. Initial rates are plotted versus CdCl, 
concentration. Assays were as in Fig. 3d. Values are averages; error bars 
represent s.d. (n = 2). Initial rates, rather than apparent overall rates, were used 
here because the branching efficiency did not level off by 120 min at Cd?* 
concentrations below 0.025 mM (assuming an endpoint of ~ 0.4). In support of 
this approach, at saturating Cd’ both wild-type and U80-PS(S,) spliceosomes 
branched similar fractions of the 3’S-PS(R,) substrate (see c), indicating that 
addition ofa sulphur in U6 within the catalytic core did not necessarily alter the 
population of complexes that are competent for catalysis. b, Transcribed (txt.) 
and ligated (lig.) wild-type U6 behaved similarly relative to U80-PS(S,) for 
branching of the 3’S-PS(R,) substrate at a limiting Cd?* concentration. Initial 
rates are shown for bretchans of the 3’S-PS(R,) substrate in the presence of 
0.01 mM Cd’*. ¢, A sulphur at U80 pro-Sp shifts the Cd?” titration midpoint 


for rescue of the 3’S-PS(R,) substrate relative to the 3’S-PS(S,) substrate. 
Assays were as in Fig. 3d; the 3’S-PS(R,) data from Fig. 3d are shown again here 
for comparison. Curves represent Hill fits to the data. Values are averages; error 
bars, s.d. (n = 3). Although the Cd?* titration for rescue of U80-PS(S,) 
spliceosomes assembled on the 3’S-PS(S,) substrate did not plateau under our 
experimental conditions (panel c), the data nevertheless set a lower limit for the 
apparent transition midpoint; further, this midpoint is equal to or greater 
than that observed for U6 wild-type spliceosomes, indicating that the shift by 
U80-PS(Sp) of the Cd?* transition midpoint for rescue was specific for the 
3'S- PS(Rp) substrate. d, With the 3’S-PS(R,) substrate, U80-PS(S,) decreased 
the Cd?* titration midpoint for rescue of branching by sixfold. The apparent 
midpoint for G78-PS(R,) spliceosomes is shown as an additional specificity 
control for the shift observed for U80-PS(S,) (actual titration data not shown). 
The apparent Cd?* rescue midpoints were obtained by fitting titration 

curves to the general Hill equation (see Methods). Error bars represent error of 
the Hill fit. e, Representative raw data for Fig. 3d. Assays were in the presence of 
0.01 mM Cd** when indicated. 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a U6 WT U6 WT b 
= 1mM Mg2+ -e-1mM Mg@+ ve Low Cd2+ 
= 0.3 , it > (0.01 mM) 
fos / T Ss. 
@ 7) o 
g } \ e 
© 0.2 5 
2 1 ° 
\ f= oMg+Cd 
f= & F rs} @Mn+Cd 
© 2 metals 01 7 1metal & 
a . _ ‘ a 
() = WT G78 _U80 
0.0001 0.001 0.01 0.1 1 0.0001 0.001 0.01 0.1 1 PS(Sp) PS(Ap) PS(Sp) 
Cd2+ [mM ] Cd2+ [mM] 
e U6 WT f U6 WT 
Cd2+ / Mg2+ Cd2+ / Mn2+ 
5' exon 5' exon 
J IS M1 ~~ 
Or GE aut . = 42 Pp an a 
c A - ‘ 6 _ pro- SNE ‘cot sé pro- 
High Cd2+ ye? Sp Ayr” Rp so" Se 
(1 mM) ISL > IsL |= : sR 2 
f =| 0, 
s 0.3 U6 tN intron ue | : Yintron 
3 '/™* 5 O- 3 ® 5d 
2 5 x sto a M2 fe} 
2 BrA -G ao 
2 0.2 OMg helix Ib P helix Ib 
5 OMg+Cd Ure ae 
& 01 @Mn Cy 
a 2 a 
Pee! U80-PS(Fp) h U80-PS(Ap) 
é Cd2+ / Mg2+ Cd2+ / Mn2+ 
5' exon " 
WT G78 U80 U80 - prexon 
PS(Sp) PS(Rp) PS(Sp) koto rf — hr» Mi 
ce xe OQ 3 6- peo AYO (co fi 
ty % Rp | oe 2 sons a | oe 
ist [= 5. ras, IsL [z vo, Jia 
d us us0-Pa(S,) Vie - ( : Yintron | p ‘© ( t Siniron 
-PS(Sp 3 f~n_': (U6 " ' 
-e1mM Mg2+ 5 ttt -) M2 at <n’ * ' m2 «Of 
4, helix ib >a Br.A eer Ee Nas 
oO > elix Ib 
a TaN: >. 
x} 3! D 5 
5 0.2 3 
2 i U80-PS(Sp) j U80-PS(Sp) 
= o4 Cd2+ /Mg2+ Cd2+ / Mn2+ 
ao Q 5' exon 5' exon 
M1 xO 
AS pe M1 po 
N, ai 1S, 
0 ENG. (co so- pro- KYO Qn $6. pro- 
0.0001 0.001 0.01 O01 1 oS a eS ao ow . oe 
‘ % s Oh % : 
Cd2+ [mM] ISL |= vs: ‘sp, ee be : % Pa » 
SP eS 6 Sinteon =| Se, = "~ 
U6 : ' - — (: intron 
, : U6 : ' 
er oi © 50- 3 fn @ M6. 
helix ib [a Br.A % 


Extended Data Figure 7 | Further evidence for two distinct catalytic metal _ the affect of Mn** on the titration curves. e-j, Metal binding during branching 
sites during branching. a, Branching requires two catalytic divalent metals. for different combinations of sulphur substitutions in the substrate and 

b-d, The 5’ splice site pro-R, oxygen and the U80 pro-S, oxygen interact witha U6 in the presence of the indicated metals, as reflected by the data in panels 
metal distinct from the metal that interacts with the 5’ splice site pro-R, oxygen _a-d. Panels e and f reflect data in a, b and c; panels g and h reflect data in b and 
and the U80 pro-R, and G78 pro-S, oxygens. See Supplementary Note 7 for c; panels i andj reflect data in b, cand d. Relevant U6 ligands are coloured red in 
discussion. Spliceosomes were assayed for branching as in Fig. 3e; where each panel; the nucleophile is coloured orange. Metals are coloured magenta 
indicated, MgCl, or MnCl, were present at 1 mM. Curves represent Hill fits to (Cd?*) and blue (Mn**), and their interactions with specific U6 ligands are 
the data. The data in a are reproduced here from Fig. 3e to aid comparison. In _ depicted as dashed lines, with differential shading intensity meant to illustrate 
panels a-d, values are averages; error bars, s.d. (n = 3). Cd’* was limiting in _ differences in the expected strength of interaction with oxygen compared with 
b to sensitize the assay to binding of a second metal, and Cd** was saturatingin _ sulphur, as inferred from studies with model compounds**". Shading of metals 
c to show that G78-PS(S,) and U80-PS(S,) have the potential to be rescued at bound at M1 and M2 is further adjusted to reflect experimental observations. 
levels comparable to wild-type U6. Panel d shows that U80-PS(S,) eliminates _ Panels for G78-PS(S,) would look the same as those for U80-PS(R,) (g and h). 
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Extended Data Figure 8 | A sulphur at the 3’ splice site but not the 5’ splice 
site leaving group alters the metal specificity for rescue of U80-PS(S,) 
spliceosomes: further evidence that U80 interacts with a catalytic metal 
during exon ligation. a, A sulphur at the 3’ splice site leaving group during 
exon ligation alters the metal specificity for rescue of U80-PS(S,) spliceosomes. 
Assays were as in Fig. 2j. The bar graph depicts quantification of exon 
ligation efficiency. b, A sulphur at the 5’ splice site leaving group during 
branching does not alter the metal specificity for rescue of U80-PS(S,) 
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spliceosomes. Assays were as in Fig. 3a, except with the 3’S-PO substrate. 
Values are averages; error bars, s.d (n = 3). c-f, Inferred metal binding during 
exon ligation (c, d) and branching (e, f) for the indicated combinations of 
sulphur substitutions in the substrate and U6 in the presence of the indicated 
metals. Relevant U6 ligands are coloured red in each panel; the nucleophile 
is coloured orange. Metals and ligand interactions are coloured as in Extended 
Data Fig. 7. 
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Extended Data Figure 9 | Thiophilic metals rescue exon ligation for 
substrates containing a sulphur at the 3’ splice site leaving group in both a 
mutated and wild-type 3’ splice site context. a, Cd°* specifically stimulates 
the rate of exon ligation for the mutated, UAc-3’S substrate. Assays were as in 
Fig. 4a. b, Exon ligation occurs at the correct site for the 3’S substrate. RNA 
from affinity-purified spliceosomes assembled on ACT1-3'O, ACT1-3'S, and 
ACT1-UAc-3'O and chased as in Fig. 4a was subjected to RT-PCR using the 
primers depicted in blue arrows. c, Diagram of the photocaged linkage at the 3’ 
splice site. d-f, Cd’* specifically stimulates the rate of exon ligation for the 
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wild-type, UAG-3’S substrate. Assays were as in Fig. 4a, except that before 
addition of divalent metals, samples were irradiated with 308 nm light 

on ice for 5 min to remove the photocage. Shown are a representative gel 

(d), quantification of the reaction end points (e) and quantification of reaction 
rates (f). g, Representative raw data for Fig. 4b; bands within each set came 
from non-adjacent lanes on the same gel. Values are averages; error bars, s.d. 
(n = 3). h, Summary of combinations of sulphur substitutions in U6 and the 
substrate tested for rescue of exon ligation. The + sign indicates that exon 
ligation was observed in the presence of thiophilic metal. 
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Extended Data Figure 10 | Residue D1853 of the RNaseH-like domain of 
Prp8 does not have a direct role in metal-mediated catalysis of exon ligation. 
a, b, Spliceosomes assembled on the indicated ACT1 UAG (a) or UAc (b) 3’0 
or 3'S substrates were assayed as in Fig. 2), in the absence of ATP. Splicing 
extracts were prepared from either a wild-type PRP8 strain or a mutant strain 
having the prp8-D1853C mutation. See Supplementary Note 15 for details anda 
discussion. Values are averages; error bars, s.d. (n = 2); no inc, no incubation. 
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The trajectory, structure and origin of the 
Chelyabinsk asteroidal impactor 


Jiti Borovi¢ka', Pavel Spurny', Peter Brown”, Paul Wiegert*’, Pavel Kalenda‘, David Clark”? & Lukas Shrbeny! 


Earth is continuously colliding with fragments of asteroids and 
comets of various sizes. The largest encounter in historical times 
occurred over the Tunguska river in Siberia in 1908, producing’? 
an airburst of energy equivalent to 5-15 megatons of trinitrotoluene 
(1 kiloton of trinitrotoluene represents an energy of 4.185 X 10’” 
joules). Until recently, the next most energetic airburst events occurred 
over Indonesia? in 2009 and near the Marshall Islands* in 1994, both 
with energies of several tens of kilotons. Here we report an analysis of 
selected video records of the Chelyabinsk superbolide’ of 15 February 
2013, with energy equivalent to 500 kilotons of trinitrotoluene, and 
details of its atmospheric passage. We found that its orbit was similar 
to the orbit of the two-kilometre-diameter asteroid 86039 (1999 NC43), 
to a degree of statistical significance sufficient to suggest that the 
two were once part of the same object. The bulk strength—the 
ability to resist breakage—of the Chelyabinsk asteroid, of about 
one megapascal, was similar to that of smaller meteoroids® and 
corresponds to a heavily fractured single stone. The asteroid broke 
into small pieces between the altitudes of 45 and 30 kilometres, pre- 
venting more-serious damage on the ground. The total mass of sur- 
viving fragments larger than 100 grams was lower than expected’. 

The data for Tunguska are limited to tree damage and records of 
seismic or acoustic waves at large distances. The Indonesia and Marshall 
Islands impacts were detected only by distant infrasonic stations or 
satellites in orbit and were therefore poorly documented. Some camera 
data exist for the multikiloton Sutter’s Mill event’, but precise analyses 
of the ablation process based on imaging have been made only for 
impacts of metre-sized bodies with energies of ~0.1 kt trinitrotoluene”” 
and less’. 

The Chelyabinsk impact occurred unexpectedly over a relatively den- 
sely populated Russian region during sunrise on 15 February 2013. 
The superbolide (an extremely bright meteor) generated a damaging 
air blast wave. An 8-m-wide hole in the ice of Lake Chebarkul, 70 km 
west of Chelyabinsk, was reported shortly after the event. Thousands 
of small meteorites of total mass >100 kg, classified as LL5 ordinary 
chondrites, were found in the areas south-southwest of Chelyabinsk”’ : 

Here we determine the bolide trajectory and orbit and describe the 
ablation process of the asteroid. The main data for these analyses were 15 
bolide videos publicly available on the internet (Extended Data Table 1). 
We calibrated these videos with wide-field stellar imagery. Details of our 
procedure, which was based on the least-squares method”, are given in 
Supplementary Information. The trajectory and speed of the bolide are 
presented in Table 1. The observed low deceleration provides an extreme 
lower limit of 10° kg for the mass of the body. The measured energy® and 
speed provide a best estimate of the mass of ~1.2 X 10” kg, correspond- 
ing to a diameter of ~19m assuming a bulk density of 3,300 kgm_*. 

The pre-impact orbit (Table 2) is consistent with an origin in the main 
asteroid belt, most probably in the inner main belt near the v¢ secular 
resonance. We integrated the orbit and 1,000 test particles within the 
orbital uncertainties (a probability cloud) 2,000 years into the past. The 
asteroid spent the six weeks before impact within an elongation of 45° 


from the Sun, a region of the sky inaccessible to ground-based tele- 
scopes. At earlier times, the asteroid was always too faint to be seen 
when some portion of the probability cloud was in the field of view of 
existing asteroid surveys. We note that the 2.2-km-diameter’* near- 
Earth asteroid 86039 (1999 NC43) of spectral type Q’® (corresponding 
to ordinary chondrites) has a very similar orbit, with very low dissimi- 
larity criteria, D = 0.050 (ref. 17) and D’ = 0.018 (ref. 18), relative to 
Chelyabinsk asteroid. Though this does not provide an unequivocal 
dynamical link, such a close match is unlikely statistically. We expect 
227 near-Earth asteroids brighter than 86039 to exist'?. Selecting at 
random from the expected distribution of near-Earth asteroids”, it 
takes an average of 6 X 10° draws before selecting one with a smaller 
Dvalue, and more than 3 X 10° draws before selecting one with a smaller 
D' value. Because 227/600,000 and 227/3,000,000 are equivalent to 
1:2,600 and 1:13,000, respectively, we conclude that there is an approxi- 
mately 1:10,000 chance that the proximity of these orbits is due purely 
to chance. The two orbits have maintained two intersection points over 
the past 2,000 years, one near perihelion and one near aphelion (Extended 
Data Fig. 1). The minimum velocity kick required to eject the Chelyabinsk 
asteroid from 86039 is 0.7kms * (aphelion) or 2 km s | (perihelion). 
This ejection velocity is consistent with a collision with another aste- 
roid (which would provide a kick of a few kilometres per second). 
The fragmentation during atmospheric entry was studied using the 
bolide light curve’ and deceleration with a procedure developed recently". 
The arrival times of secondary sonic booms heard on videos were also 
used to locate fragmentation points (the primary blast wave was caused 
by the cylindrical shock from the trajectory’). The dynamic pressure 


Table 1 | Trajectory of the Chelyabinsk superbolide 


Time (s) Longitude (°) Latitude (°) Height (km) Speed (kms~+) 
1.07 64.477 54.454 95.0 19.03 
6.97 62.888 54.664 60.0 19.05 
10.46 61.933 54.780 40.0 19.03 
12.24 61.442 54.837 30.0 18.9 
13.18 61.193 54.864 25.0 18.0 
14.18 60.943 54.892 20.0 14.2 
LS.17 60.802 54.907 17.2 6 

Fragment F1 
14.32 60.945 54.893 20.0 13.5 
16.04 60.704 54.922 15.0 6.4 
17.80 60.5883 54.9361 12.57 a2 


Time zero corresponds to approximately 3:20:20 ut. Coordinates are given in the WGS84 geoid system. 
Speed is relative to the Earth’s surface. At the beginning, the acceleration due to gravity was larger than 
deceleration due to atmospheric drag. The beginning speed (19.03 + 0.13 kms +) remained constant 
to within 0.02 kms! down to an altitude of 35 km. The bolide was recorded in the analysed videos 
between altitudes of 95.1 and 12.6 km, over a trajectory of total length 272 km. The apparent radiant 
was changing owing to Earth’s gravity, starting at right ascension «= 21h 53 min 7s+ 24s 

(328.28° + 0.10°) and declination 6 = +7° 28’ + 3’ (+7.47° + 0.05°) at altitude 95 km and changing to 
a= 21h51 min 58s (327.99°), 5 = +7° 53’ (7.88°) at altitude 20 km. These values correspond toa local 
radiant azimuth of A = 103.50° + 0.09° (east of north) and elevation h = 18.55° + 0.08", and, 
respectively, A = 100.63° and h = 16.95° (we note that A and h change also as a result of Earth’s 
curvature). For comparison, the radiant of fragment F1 at 20 km was A = 101.87° + 0.4°, 

h=17.53° + 0.3°. 
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Table 2 | Pre-impact orbit of Chelyabinsk asteroid and the orbit of asteroid 86039 (1999 NC43) 


Chelyabinsk 86039 
Semimajor axis, a (AU) 1.72 + 0.02 1.759621064+6 x 107% 
Perihelion distance, q (AU) 0.738 + 0.002 0.7403896 + 1 x 1077 
Eccentricity, e 0.571 + 0.006 0.57923346+6 x 10° 
Inclination, i (°) 4.98 +0.12 7.12312+1x10°° 
Longitude of the ascending node, Q (°) 326.459 + 0.001 311.81880+5 x 10-5 
Argument of perihelion, @ (°) 107.67 + 0.17 120.55894+5x10° 


Time of perihelion passage 


2012 December 31.39 + 0.17 


2014 January 17.45565 +2 x 107° 


The geocentric radiant and velocity of Chelyabinsk were ag = 22h 11 min 17s + 26s (332.82 + 0.11°), dg = +0°17' + 8’ (+0.28° + 0.14°), vg = 15.14 + 0.16 km s- 1. The orbit was obtained by numerical 
integration back in time to 60 d pre-impact, with the RADAU integrator®*°. The orbit of 86039 was taken from NASA JPL Database (http://ssd.jpl.nasa.gov/sbdb.cgi) on 7 June 2013. All angular coordinates are in 


equinox J2000.0. 


acting on the body was computed at each fragmentation point to 
evaluate the mechanical strength of the body. 

The first significant mass loss occurred at an altitude of about 45 km, 
under a dynamic pressure of 0.7 MPa. The series of most severe frag- 
mentations occurred at altitudes between 40 and 30 km (1-5 MPa). 
Acoustic analysis revealed 11 individual fragmentations between alti- 
tudes 39.2 and 29.8 km (Extended Data Fig. 2), the two strongest at 
31.9 and 30.6 km, respectively (with uncertainties of 0.3 km). By 29 km, 
the asteroid had already fragmented into about 20 large boulders of 
masses ~10* kg. Judging from the deceleration, the mass of the leading 
boulder (main body) was ~2 X 10* kg. The boulders started to break 
up again at altitudes of 26-24 km (10-13 MPa). The main body remained 
intact and quite massive (~10*kg) until an altitude of 22 km, before 
severe disintegration began at a pressure of 18 MPa. Only a 15-kg frag- 
ment remained from the leading main mass at 17 km. Fragment F1, the 
largest individual piece surviving the descent, separated from one of the 
more decelerated boulders (not the main body) at an altitude of about 
25 km, surviving a maximal dynamic pressure of 15 MPa at 20 km. 
Its trajectory deviated from the original direction of flight (Extended 
Data Fig. 3) by 1.3° + 0.4°, implying that the lateral velocity gained at 
the break-up was 400 + 130ms_’. This lateral velocity is almost an 
order of magnitude larger than aerodynamic theory” and laboratory 
experiments” predict. Nevertheless, it is similar to the behaviour of the 
Moravka bolide”’, suggesting that forces other than purely aerodynamic 
effects are also present during the fragmentation of bolides. Judging 
from its deceleration, the terminal mass of fragment F1 was 450 + 50 kg 
and dark-flight computation predicts a landing point in Lake Chebarkul, 
within 300 m of the actual impact site (Extended Data Fig. 4). The predicted 
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Figure 1 | Ground projection of the terminal part of the bolide trajectory 
and meteorite-strewn field. Main trajectory (thick red line) and trajectory of 
fragment F1 (thin orange line) as plotted on Google Earth. The marks denote 
altitudes in kilometres. The predicted impact positions of 11 observed 
fragments (F1-F4, F6, F7 and F11-F15) are shown. The impact positions of 
simulated small (non-observable) fragments are also given. Yellow dots denote 
fragments that separated at lower altitude (21-26 km), pink fragments originate 
in the main break-ups at 30-39 km and brown fragments are from 

break-ups above 40 km. The three dot sizes correspond to terminal masses of 
1-10 g, 10-100 g and >100g. The second largest fragment, F2, had an 
estimated mass of 30 kg based on its observed deceleration. The dynamics after 
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impact points and masses of other observed fragments (Extended Data 
Fig. 5) are given in Fig. 1 and Extended Data Fig. 6. Judging from the 
light-curve shape, the total mass of surviving fragments >100 g was at 
most only several per cent of the original mass and probably much less. 
It is much less that the pancake model predicts (>50%) and less than 
the separated-fragments model predicts’ (>10%). 

The fragmentation history shows that the bulk strength of the Chelya- 
binsk asteroid was ~1 MPa, a value typical for smaller meteoroids, con- 
firming that for near-Earth objects there is little correlation between 
strength and size on scales from centimetres to tens of metres®. Only the 
very strongest parts, representing <1% of the original mass, had strength 
>15 MPa, which may be comparable to the exceptionally strong Carancas 
meteoroid’** and typical tensile strengths of recovered stony meteorites®. 
Even in the Martian atmosphere, the body would probably separate 
into several large fragments before reaching the surface. It is therefore 
not surprising that most fresh craters on Mars are found in clusters”. 

A large dust trail was left in the atmosphere after the bolide passage. 
The southern and bottom sides of the trail were illuminated by the 
rising Sun. From northern sites the trail looked thin because only the 
illuminated bottom was visible, except for the first seconds, when the 
trail was self-luminous. The trail started at an altitude of 68 km. Between 
60 and 26 km, the trail was thick, with radius 1.0-1.8 km. A thinner 
trail continued to 21.5 km and a very thin part extended to 18 km. The 
total volume of the trail was of order 600 km? and its air mass was 
~5 X 10’kg. The lower edge of the thick trail was almost stationary 
immediately after the passage of the bolide but then moved forwards 
in two waves as the material originally released at higher heights arrived. 
The velocities of these shocks were 2.8 and 1.7kms_', respectively 
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luminous flight ceased’? (dark flight) was computed using the wind field from 
the nearest available radiosonde at Verkhnee Dubrovo (180 km north of 
Chelyabinsk) measured at 0:00 uT on 15 February 2013. Using the wind field 
measured by radiosonde at Kurgan (250 km east of Chelyabinsk) would shift 
the meteorites much more to the south (by 2.5 km for a 200-g meteorite). The 
position of the impact hole in Lake Chebarkul (marked ‘Crater’) and the 
centroid of strewn field of small meteorites’? are also shown. See also Extended 
Data Fig. 4. We expect that, as in other cases", the mass distribution will be 
more complicated and the strewn field will be larger than our idealized 
model predicts. 
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(Extended Data Fig. 6). Larger regions of hot material within the trail 
continued moving forwards for some time after the bolide passage. 
The forward velocity of the brightest (and lowest) of several distinct 
hotspots (Extended Data Fig. 7) decreased from 0.7 to0 kms’ during 
the period 1.5-4s after the passage of the body. At the same time, a 
constant vertical velocity of 0.08 kms’ ' was measured for this hotspot 
(Extended Data Fig. 6). The upward motion, caused by buoyancy of 
the hot mixture of air, vapour and dust, continued when the forward 
motion stopped. About 15s after the bolide passage, splitting of the 
trail into two parallel tracks became apparent (Extended Data Fig. 8), 
an effect seen in high-altitude luminous meteor trails and caused by 
convective instability leading to vortices”®. After 50 s, the top of the cloud, 
which formed from the hotspot, was 3 km above the bottom of the trail 
(the trail as a whole moved about 0.5 km upwards). The maximum 
height of 6-7 km was reached after 3 min (Extended Data Fig. 9). 

On the basis of the fragmentation strength of the Chelyabinsk aste- 
roid, we conclude that it was probably a fractured single stone and not 
a rubble-pile assemblage, whose expected strength would be only ~25 Pa 
(ref. 27). The unusually close orbital association with asteroid 86039 
suggests a possible genetic connection. Detailed comparison of reflec- 
tance spectra from the Chelyabinsk meteorite and 86039 are highly 
desirable to help explore this relationship. If the orbital association is 
real, we would also predict a very short exposure age for Chelyabinsk 
relative to other LL chondrites, because such orbital associations can 
only persist for a small fraction of the dynamical lifetime of an near- 
Earth asteroid. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Extended Data Figure 1 | Visibility and orbital evolution of Chelyabinsk intersection distance (MOID) between the Chelyabinsk orbit and the 
asteroid in the past. The results of backward integration of Chelyabinsk osculating orbit of asteroid 86039 during the past 2,000 years. c, Change in 
nominal orbit (red) and its 1,000 clones (black dots). a, Apparent magnitude as _ velocity required to reach Chelyabinsk orbit from the orbit of 86039 at MOID 
seen from the Earth at 30-day intervals during past 10 years. Green, mean ofall _ during the past 2,000 years. 

clones. Plotted only for elongations >45° from the Sun. b, Minimum orbit 
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Extended Data Figure 2 | Light curve of Chelyabinsk superbolide in relative 
units and fragmentation altitudes as determined from sonic booms. The 
luminous signal was computed in relative units from pixel sum values from 
substantial parts of the images, and then normalized to 100. Corrections to 
bolide range and atmospheric extinction were applied but no attempt to 
convert the signal to absolute units was made (for the absolute light curve, see 
ref. 5). For each video, the measured pixel sum was corrected using the 
estimated changes of automatic gain control of the camera. The absolute timing 
was obtained from the Nizhny Tagil video (L1) and the height scale from 

the Beloreck video (video 14, or L4). The fragmentation altitudes were 


determined from the timing of secondary sonic booms and numerical ray- 
tracing modelling of the sonic wave’s propagation from the bolide to the video 
sites. The videos used are listed in Extended Data Table 1. a, Bolide light curve 
as a function of time. b, The same data as a function of height compared 
with the computed source heights of sonic booms detected (as image failures) in 
the Mirnyi video (A19). The fragmentations are marked by vertical bars at the 
corresponding height. The length of the bar is proportional to the number 

of video frames affected by the failure. c, The compilation of sonic boom source 
heights from all 19 videos used for acoustic analysis. 
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Extended Data Figure 3 | Deviation of fragment F1 from the main 
trajectory. Frame from video 15. The time is counted from 3:20:20 ur. The 
labelled marks identify points on the main trajectory at the given altitude 
(in kilometres). E represents the endpoint of the main trajectory. 
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Extended Data Figure 4 | Predicted impact position of fragment F1, (12:00 ur) and point G is for the G2S model (3:00 ur) (ref. 31 in Supplementary 
computed with four different wind fields, compared with the position of Information). The distance between U and K is 960 m. The distance between F1 
the hole in the ice (‘Crater’). The point marked F1 was computed with and the crater is 220 m. We note that the position of the crater was not used 
Verkhnee Dubrovo radiosonde data (0:00 ur). Point K is for Kurgan for the computation of the F1 trajectory and impact point. The background 


radiosonde (0:00 UT), point U is for the UKMO wind model for Chelyabinsk —_ image is from Google Earth and was taken one day after the impact. 
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Extended Data Figure 5 | Identification of fragments in a series of images 
from video 7. Fragments F1-F7 originated at lower altitudes (~25 km), 
whereas fragments F11—F16 originated at higher altitudes (>30 km). 
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Extended Data Figure 6 | Dynamics of the dust trail and fragments and 
predicted impact positions of observed fragments. a, Altitude as a function of 
time for the lower edge of the thick dust trail (TE) and hotspots within the trail 
(HS1-HS3). The hotspots are identified in Extended Data Fig. 7. b, Altitude 
as a function of time for the main body (M), lower fragments (F1-F7) and 
upper fragments (F11-F16), plotted together with the dust trail features. The 
fragments are identified in Extended Data Fig. 5. The main body and trail were 
measured primarily from video 2; and the fragments, from video 7. 


c, Upward motion of the main hotspot (HS1) within the dust trail. Vertical 
deviation of the centre of the hotspot from the trajectory is plotted against time. 
The linear fit gives upward velocity of 0.08kms . d, Predicted impact 
positions and dynamic properties of observed fragments. Ablation coefficients 
and terminal masses were obtained by fitting the observed decelerations. 
Masses are valid for assumed spherical shapes and bulk densities of 

3,300 kg m °. In some cases, the ablation coefficient could not be computed 
because there was an insufficient number of data points. 
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Extended Data Figure 7 | Images of the dust trail at early stages. a—c, Images 
from a single video site (video 2) located north of the fireball trajectory. Time is 
counted from 3:20:20 ur. Three distinct hotspots (HS1-HS3) are identified. 
The labelled marks identify points on the trajectory at the given altitude 

(in kilometres). The unlabelled marks above them identify points at the same 
geographic coordinates but 1 km higher. They are provided to assess the width 
of the trail. d, Image from video 14, from the southwest. It demonstrates 

that the width of the fully illuminated fresh trail was ~2 km over much of its 
length. For later evolution of the trail, see Extended Data Figs 8 and 9. 
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Extended Data Figure 8 | Evolution of the lower part of the dust trail as seen 
from Chelyabinsk during the first minute. Three frames from video 6 

(the video has colour defects). The time is given in minutes and seconds and is 
counted from 3:20:20 ur. The lower marks identify points on the trajectory in 
1-km altitude intervals. The upper marks identify points at the same 
geographic coordinates but 1 km higher. The video demonstrates vertical 
ascent and splitting of the trail. When the original video is speeded up, rotation 
of the material in the trail is clearly visible. The trail was illuminated from 
below. The ‘bubble’ formed at the position of the main hotspot (HS1; see 
Extended Data Fig. 7) was in shadow most of the time. Only its illuminated top 
is visible on the third frame, just at the edge of the field of view. 
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Extended Data Figure 9 | Longer-term evolution of the dust trail. 

Five frames from an uncalibrated video (http://www.youtube.com/ 
watch?v=Z20lnOVscpc, author D. Beletsky) taken from south of the fireball 
trajectory (on the road from Magnitogorsk to Chelyabinsk). The time is given 
in minutes and seconds and is counted from 3:20:20 ut. The trail was fully 
illuminated from this site. The video demonstrates the rise of the ‘bubble’ 
formed at the position of the main hotspot (HS1; see Extended Data Fig. 7). The 
maximum altitude was reached about 3 min after the bolide had passed by. 
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Extended Data Table 1 | List of used YouTube videos 
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No. City Longitude Latitude Altitude YouTube code* Author 
°E °N m 
Videos used for trajectory determination 
1 Verkhnyaya Pyshma 60.6082 56.9635 276 LzvipPOpUyO Vos'moy Rayon 
2 Kamensk Ural'sky 61.9186 56.4151 170 kFlpCT3v12E Aleksandr Ivanov 
3 Kamensk Ural'sky 61.9333 56.3933 151 7TPDwSXaiBO =LANCER96RUS 
4 Kamensk Ural'sky 61.9426 56.3852 163 J3DqsbxKOMA ~LANCER96RUS 
5 Chelyabinsk 61.2967 55.2204 246 4ZxXYscmgRg Andrey Borisovich Korolev 
6 Chelyabinsk 61.4720 55.1797 231 OM-5ngYg5Mg ___—s*Vyachesilav Kravchenko 
7 Chelyabinsk 61.3935 55.1756 228 8Eu7QAP2DPM _ Viktor Borzov 
8 Chelyabinsk 61.4448 55.1663 231 32aJ4RB8Mal MegaProfitroll 
9 Chelyabinsk 61.3637 55.1500 257 gQ6Pa5Pv_io Dmitriy Volkov 
10 ~~ Korkino 61.3995 548909 241 odKjwrjIM-k nek rozato 
11. Yemanzhelinsk 61.3040 54.7566 234 2Gc1HgOShNY = Aleksandr Zakharov 
12 — Kichigino 61.2717 54.5008 238 OCoP7WB8Gew Mikhail Troitsk 
13. Troitsk 61.5313 54.0771 183 UjNpJXP7trQ C2Crash 
14. Beloreck 58.4102 53.9527 532 5_1ytDqps8A MrKuzaman 
15 Chebarkul 60.4002 54.9950 343 xboo7LiNRO8 yulya Ryzhaya 
Videos used for light curve measurement 
L1 Nizhny Tagil 59.9439 57.8703 220 NcZNhJVW5xI" — SetiTagila 
L2 Tyumen’ 65.6053 57.2024 60 1ZdYf2vM5LA EastSide287 
L3 Kurgan 65.2956 55.4735 70 X2ja6_zJtzk Dmitriy Grekov 
L4 _ Beloreck 58.4102 53.9527 532 5_1ytDqps8A MrKuzaman 
Videos used for acoustic analysis 
A1 Chelyabinsk 55.1647 61.5482 HaMurqKMenw _ Alexandr Gubarev 
A2  Malinovka 55.0900 61.2500 5Rh2-v-gFEs Svladislav74 
A3  Pervomaysky 54.8726 61.2008 R99zvecrqXo8 Axel Alex 
A4 = Chelyabinsk 55.2675 61.4082 hMZkv0-2500 NIK14174 
A5 Chelyabinsk 55.1799 61.3479 5xMOSY4bW_M_ Aleksej D 
A6 Chelyabinsk 55.1570 61.3657 uXU3z3-bxNk Andrey Yurkin 
A7 Chelyabinsk 55.1500 61.3637 gQ6Pa5Pv_io Dmitriy Volkov 
A8 = Chelyabinsk 55.1114 61.3509 Np_mpGYSBSA_ Serg Kh 
AQ ~~ Chelyabinsk 55.1650 61.4070 rfITN4XAt34 Maxim Savelyev 
A10_ Chelyabinsk 55.1582 61.4113 G2KpK_GmvA8 Yuriy Bazhaev 
A11_ Chelyabinsk 55.1570 61.4452 38ZQkZCWILO Sergey Polyakov 
A12_ Chelyabinsk 55.1636 61.4703 zkVUbMdAV2Q__ Alexandr Burlakov 
A13 Chelyabinsk 55.1665 61.4639 QTZOvesq5fA greshnikacw 
A14 Yemanzhelinsk 54.7564 61.3280 KmjjGgvOIMY Daniil Lysenko 
A15 Chelyabinsk 55.1404 61.4780 16KxOxjSndA dima stepanov 
A16 Kopeysk 55.1210 61.6046 VZJEfNId_JU Aleksandr Al'shevsky 
A17 Kopeysk 55.1103 61.6042 4x3xG-eKpNA ZaTDro 
A18 Kopeysk 55.1087 61.6183 UsOEy5hCCow Andrey Kostomarov 
A19_ Mirnyi 54.577 60.312 LmlwWE Dylx0 kaban0796 


* http://www. youtube.com/watch?v=code 


T The last of the four videos in the compilation was used 


©2013 Macmillan Publishers Limited. All rights reserved 


1 sid Wal Be 


doi:10.1038/nature12741 


A 500-kiloton airburst over Chelyabinsk and an 
enhanced hazard from small impactors 


P. G. Brown!?, J. D. Assink®, L. Astiz*, R. Blaauw”, M. B. Boslough’®, J. Borovicka’, N. Brachet*, D. Brown’, M. Campbell-Brown', 
L. Ceranna’, W. Cooke!®, C. de Groot-Hedlin*, D. P. Drob!!, W. Edwards”, L. G. Evers!**!*, M. Garces", J. Gill!, M. Hedlin’, 

A. Kingery", G. Laske*, A. Le Pichon’, P. Mialle®, D. E. Moser®, A. Saffer’®, E. Silber’, P. Smets'*-4, R. E. Spalding®, P. Spurny’, 
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Most large (over a kilometre in diameter) near-Earth asteroids are 
now known, but recognition that airbursts (or fireballs result- 
ing from nuclear-weapon-sized detonations of meteoroids in the 
atmosphere) have the potential to do greater damage’ than prev- 
iously thought has shifted an increasing portion of the residual 
impact risk (the risk of impact from an unknown object) to smaller 
objects”. Above the threshold size of impactor at which the atmo- 
sphere absorbs sufficient energy to prevent a ground impact, most of 
the damage is thought to be caused by the airburst shock wave’, but 
owing to lack of observations this is uncertain**. Here we report an 
analysis of the damage from the airburst of an asteroid about 
19 metres (17 to 20 metres) in diameter southeast of Chelyabinsk, 
Russia, on 15 February 2013, estimated to have an energy equivalent 
of approximately 500(+100) kilotons of trinitrotoluene (TNT, 
where 1 kiloton of TNT =4.185 X 10’? joules). We show that a widely 
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Figure 1 | Light curve of the Chelyabinsk airburst. a, The brightness profile 
for the Chelyabinsk airburst, based on indirect illumination measured from 
video records. The brightness is an average derived from indirect scattered sky 
brightness from six videos proximal to the airburst, corrected for the sensor 
gamma setting, autogain, range and airmass extinction, following the 
procedure used for other airburst light curves generated from video”*”*. The 
light curve has been normalized using the US government sensor data peak 
brightness value of 2.7 X 10'° W sr _', corresponding to an absolute 
astronomical magnitude of —28 in the silicon bandpass. The individual video 
light curves deviate by less than one magnitude between times —2 and +1.5 
with larger deviations outside this interval. Time zero corresponds to 
03:20:32.2 uTC on 15 February 2013. b, The energy deposition per unit height 


referenced technique** of estimating airburst damage does not 
reproduce the observations, and that the mathematical relations’ 
based on the effects of nuclear weapons—almost always used with 
this technique—overestimate blast damage. This suggests that earl- 
ier damage estimates*® near the threshold impactor size are too 
high. We performed a global survey of airbursts of a kiloton or more 
(including Chelyabinsk), and find that the number of impactors 
with diameters of tens of metres may be an order of magnitude 
higher than estimates based on other techniques*’. This suggests 
a non-equilibrium (if the population were in a long-term collisional 
steady state the size-frequency distribution would either follow a 
single power law or there must be a size-dependent bias in other 
surveys) in the near-Earth asteroid population for objects 10 to 
50 metres in diameter, and shifts more of the residual impact risk 
to these sizes. 
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for the Chelyabinsk airburst, based on video data. The conversion to absolute 
energy deposition per unit path length assumes a blackbody emission of 
6,000 K and bolometric efficiency of 17%, the same as the assumptions used 
to convert earlier US government sensor information to energy”®. The heights 
are computed using the calibrated trajectory’® and features of the light 
curves common to different video sites, resulting in a height accuracy of 
about 1 km. The total energy of the airburst found by integrating under the 
curve exceeds 470 kt. The half-energy-deposition height range is 33-27 km; 
these are the heights at which energy deposition falls below half the 

peak value of approximately 80 kt per kilometre of height, which is reached 
at an altitude near 29.5 km. 
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The Chelyabinsk airburst'® was observed globally by multiple 
instruments—including infrasound, seismic, US government sensors 
and more than 400 video cameras—at ranges up to 700 km away. The 
resulting airblast (shock wave travelling through the air from an explo- 
sion) shattered thousands of windows in urban Chelyabinsk, with 
flying glass injuring many residents. 

Data from US government sensors timed the peak brightness to 
03:20:32.2 UTC (coordinated universal time) on 15 February 2013 with 
an integrated radiated energy of 3.75 X 10'*J and a peak brightness of 
2.7 X 10'* Wsr_'. These values correspond to an estimated energy 
equivalent of about 530 kt of TNT. The peak brightness was equivalent 
to an absolute astronomical magnitude of —28 (referenced to a range 
of 100 km) in the silicon bandpass, making the airburst appear 30 times 
brighter than the Sun to an observer directly under this point. The 
airburst’s light curve has been reconstructed by considering the mea- 
sured light production from several video records (see Supplementary 
Information for details) as shown in Fig. 1. We note that point-like 
models** of airburst energy deposition, which treat the impactor as a 
strengthless, liquid-like material, predict that the height range in which 
the energy deposition per unit path length falls to half its maximum 
value is less than 2 km for impacts as shallow (17° from the horizontal)'° 


Source height (km) 


Overpressure (kPa) 


50.02 seconds 


Figure 2 | Observed and predicted shock characteristics for the Chelyabinsk 
airburst. a, Theoretical airblast overpressures using standard nuclear weapons 
relations’ and the cylindrical-line source blast theory!’ (which assumes the 
explosion occurs so swiftly that it is equivalent to a single instantaneous 
detonation of a long cylindrical line of explosive charge) that is appropriate to 
central Chelyabinsk. The nuclear relation curves (in black) assume a spherical 
point source at a specific height and show assumed yields of 500 kt (dashed line) 
and 1 Mt (dotted line). The cylindrical-line source airblast model (red line) uses 
the energy deposition per unit length from Fig. 1b to define an equivalent blast 
radius as the source and assumes that the shock is linear at the ground (linear 
means its amplitude is low enough to be well approximated as moving at 

the local ambient speed of sound and non-linear effects are negligible). 

b, Travel-time residuals between the time of airburst passage at each height and 
the main airblast arrival for 38 videos (see Supplementary Table 5). The 
residuals (black circles) show the observed arrival time (corrected for fireball 
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Table 1 | Energy estimates for the Chelyabinsk airburst 


Technique Best estimate (kt) Range (kt) 
Seismic 430 220-630 
Infrasound (mean period) 600 350-990 
US government sensor 530 450-640 
Video-derived lightcurve >470 


Here ‘kiloton’ refers to the energy equivalent to a kiloton of TNT. To estimate the energy from infrasonic 
airwaves, all 42 infrasound stations of the International Monitoring System? were examined. Of these, 
20 stations showed clear signals from the airburst. Our infrasound energy estimates are based on the 
average observed dominant infrasound period from 12 stations that have stratospheric returns 
showing the highest signal-to-noise ratio. Seismic Rayleigh waves generated by the airburst 

shock wave impinging on the Earth’s surface just south of Chelyabinsk were detected by about 70 
seismic stations at ranges in excess of 4,000 km. The amplitude of these waves in specific 

passbands as calibrated to nuclear airbursts!® were used as an independent estimate of source 
energy. 


as that of Chelyabinsk, which is less than one-third of the observed 
value (more than 6 km). (We note that any object striking the Earth or 
its atmosphere is an impactor; a ground impactor creates a crater, but 
most burn up before that, releasing a large amount of energy into the 
atmosphere as an airburst.) Airburst energy estimates from four dif- 
ferent techniques are summarized in Table 1. Our preferred mean 
energy estimate is in the range of 400-600 kt. Details of the analysis 
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motion) minus the expected time, calculated assuming propagation at the local 
adiabatic sound speed and incorporating winds'’. For comparison, the width of 
the visible cloud trail is shown (red line). This is consistent with the shock wave 
travelling faster than the ambient sound speed near the airburst. The minimum 
timing residuals suggest that shock source heights vary between 30 km and 
23 km across Chelyabinsk, as opposed to originating from a point source. 

c, Modelling the temperature of the trail (colour scale). The apparent cross- 
section of CTH simulation 50s after 0.5 Mt was released into segmented 
cylinders of air. The outer blue-grey shaded area reveals an envelope of shocked 
air—note that the dominant shape of the shock is cylindrical. d, For comparison 
with c, a video frame of the dust cloud taken 250 km to the southwest of the 
airburst path looking North is shown (http://www.youtube.com/ 
watch?v=ICv9S0Z0e0E, taken by E. Volkov). Approximately 140 km of the end 
portion of the airburst trail is shown, some 40-60 s after the passage of the 
fireball. 
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procedures and measurements are given in the Supplementary 
Information. 

To establish the nature and source height of the airblast that caused 
damage in Chelyabinsk, we used the known trajectory’® and a suite of 
videos (see Supplementary Table 5) that recorded both the airburst 
and the main airblast arrival. We computed acoustic travel times from 
each point on the airburst trajectory to each video location using a 
propagation model including winds that was developed for earlier 
airburst infrasound analyses'’. The results show that the first airblast 
wave (which also produced the damage) arrived from different alti- 
tudes at different sites, consistent with a cylindrical shock from the 
extended airburst, as opposed to a more point-like explosion. The 
timing residuals between the observed and expected arrivals follow 
the bolide trail size, as shown in Fig. 2b, consistent with the shock 
being strong early in its propagation. The airblast reaching the city of 
Chelyabinsk was generated at altitudes of 24-30 km, roughly from the 
peak to the end of the main airburst. 

In Fig. 2a we show overpressure predictions from standard airblast 
relations based on nuclear explosions’, as used by most impact-effect 
models**'*, For comparison, the predictions of cylindrical-line source 
airblast theory applied to meteor entry’’ are also shown. The airblast 
overpressure in Chelyabinsk from window breakage measurements is 
3.2 + 0.6 kPa (see Supplementary Information for details). We note 
the overestimation of overpressure using the nuclear blast relations’, 
an effect others have suggested in connection with airbursts*. Given 
that nuclear explosions release half their energy as radiation’, thus 
reducing the effective yield of airblast energy, the nuclear curve in 
Fig. 2a that is most appropriate to Chelyabinsk is about 1 Mt. 

To examine whether a fragmentation model” is consistent with the 
observed data and estimated object size, we have applied an entry 
code based on a progressive fragmentation model of the initial object. 
Assuming an initial meteoroid of diameter 19 m and a tensile strength 
at first fragmentation of 0.7 MPa (ref. 10), with ablation ending at about 
27km once most of the energy has been lost, we find a reasonable 
match to both the light curve and early dynamics. The final main 
fragmentations in this model occur near 4 MPa, very similar to those 
observed (1-5 MPa) in the most severe fragmentation portion of the 
airburst’®. The dynamics and light production from the model are not 
realistic near the terminal phase of the airburst because the model 
assumes that all fragments split identically at each fragmentation 
epoch. This is in contrast to observations at the end of the airburst 
where one leading fragment was observed” (as opposed to dozens of 
identically sized individuals). 

To further define the nature of the shock, we have used the well 
known CTH simulation framework used for the Tunguska’ airburst 
and impactors' comparable to the asteroid causing the Chelyabinsk 
airburst. The simulation used all the observed trajectory parameters’ 
and the observed energy as a function of height (Fig. 1b) to mimic the 
entry process by creating an instantaneous energy release in a sequence 
of momentum-preserving air cylinders along the airburst path, scaled 
such that the total integrated energy is 500 kt. Figure 2c and d shows 
the result of this simulation and comparison to a video record of the 
dust cloud generated by the airburst at a similar time. The notable 
characteristics are that the primary shock is cylindrical, in contrast to 
point-source energy release airburst models*®, which have a strong 
spherical shock component. Instabilities that result from fast-rising 
buoyant air in the simulation produce similar structures to those seen 
in videos of the dust cloud. 

Model overpressures for central Chelyabinsk are found to be 3 kPa, 
consistent with observations. Our estimates of overpressure are based 
on window breakage (see Supplementary Fig. 5) confined to a small 
region in Chelyabinsk. The CTH simulations were run for more than 
three minutes after the airburst, producing model variations of over- 
pressure across the entire city of Chelyabinsk which were smaller than 
the differences produced by local effects, such as shock reflections from 
buildings, numerical uncertainty in the simulation and our generally 
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Figure 3 | The estimated cumulative flux of impactors at the Earth. The 
bolide impactor flux at the Earth (bolide flux 1994-2013; black circles) is based 
on about 20 years of global observations from US government sensors and 
infrasound airwave data. Global coverage averages 80% among a total of 58 
observed bolides with E > 1 kt and includes the Chelyabinsk bolide (rightmost 
black circle). This coverage correction is approximate and the bolide flux curve 
is probably a lower limit. The brown line represents an earlier power-law fit 
from a smaller data set for bolides 1-8 m in diameter’®. Error bars represent 
counting statistics only. For comparison, we plot de-biased estimates of the 
near-Earth-asteroid impact frequency based on all asteroid survey telescopic 
search data until mid-2012 (green squares)* and other earlier independently 
analysed telescopic data sets’ including the NEAT discoveries (pink squares) 
and the Spacewatch survey (blue squares), where diameters are determined 
assuming an albedo of 0.1. From the telescopically determined number of near- 
Earth asteroids and their typical orbits we can compute the average interval 
between Earth impactors of a given energy. Energy for telescopic data was 
computed assuming a mean bulk density of 3,000 kgm’ ° and average impact 
velocity of 20.3 kms |. The intrinsic impact frequency for these telescopic data 
was found using an average probability of impact for near-Earth asteroids of 
2X10 ° per year for the entire population of asteroids. Lunar crater counts 
converted to equivalent impactor flux and assuming a geometric albedo of 
0.25 (grey solid line) are shown for comparison’, although we note that 
contamination by secondary craters and modern estimates of the near-Earth- 
asteroid population that suggest lower albedos will tend to shift this curve to the 
right and downwards. Finally, we show the estimated influx from global 
airwave measurements conducted from 1960 to 1974, which detected larger 
(5-20 m) bolide impactors (red triangles)'* using an improved method for 
energy estimation compared to earlier interpretations of the same data. 


small number statistics. This limits our ability to validate the simulated 
CTH overpressure spatially. 

Using our best estimate for the Chelyabinsk airburst energy, of 
about 500 kt, we have estimated the bolide flux at the Earth over the 
period from 1994 to mid-2013. This estimate is based on 20 years of 
total global coverage by the US government or infrasound sensors, 
more than doubling the earlier time coverage’*’’. All events with esti- 
mated yields in excess of 1 kt are included. Figure 3 shows that this 
bolide flux at small sizes (less than 5 m in diameter) is in agreement 
within uncertainties with telescopic® data and earlier infrasonic’® 
influx estimates. However, at larger diameters (15-30 m), both the 
bolide and infrasound’® flux curves show an apparent impact rate at 
the Earth an order of magnitude larger than either that estimated by 
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telescopic surveys or the longer-term average impact rate provided by 
lunar cratering. In both cases these deviations well above the constant 
power-law slope of ref. 16 are due to single large events, so caution 
must be exercised owing to the small number statistics. A best-fit 
regression line to the bolide flux is given by N= aE °, where N is 
the cumulative number of objects with energy E (in kilotons) or more 
that impact the Earth per year, a = 3.31 + 0.11 and b = —0.68 + 0.06. 
We note that excluding the rightmost two points in Fig. 3 (representing 
Chelyabinsk and two other events larger than 30 kt) produces a nearly 
identical slope. 

Using the telescopic impact frequency* (green squares in Fig. 3) asa 
baseline for the 20-year period of the bolide survey, there is only a 13% 
chance that any random 20-year period would have an airburst as large 
or larger than Chelyabinsk. The independent 14-year survey by infra- 
sound’® (1960-74) detected a probable ~1.5-Mt airburst on 3 August 
1963. Such a large event would be expected at the ~3% level during 
such a survey period. Although these deviations may be attributable to 
small number statistics, we note that Tunguska, with a source energy 
(energy released at the location of the explosion) of the order of 
3-15 Mt (refs 15,19; shown as a horizontal line in Fig. 3) is also an 
extreme outlier (expected at the 2-10% level to have occurred during 
the past century). These events, taken together with Chelyabinsk, are 
increasingly suggestive of non-equilibrium in the impactor flux for 
near-Earth asteroids that are 10-50 m in diameter. This is manifested 
as a change in the power-law energy-frequency distribution at these 
sizes, similar to changes in the power law at other sizes”’. This is also 
consistent with the recent origin of Chelyabinsk as a single near-Earth 
asteroid and a possible link to asteroid 86039 (ref. 10). Our findings 
support earlier interpretations of an influx maximum at this size 
range’'**. We note that telescopic surveys have only discovered about 
500 near-Earth asteroids that are 10-20 m in diameter® (comparable to 
the Chelyabinsk asteroid) of an estimated near-Earth asteroid popu- 
lation (http://ssd.jpl.nasa.gov) of around 2 X 10’, implying that a non- 
equilibrium impactor population at these sizes could be present but 
not yet apparent in the discovered near-Earth asteroid population. 
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Single magnetic atoms, and assemblies of such atoms, on non- 
magnetic surfaces have recently attracted attention owing to their 
potential use in high-density magnetic data storage and as a plat- 
form for quantum computing’ *. A fundamental problem result- 
ing from their quantum mechanical nature is that the localized 
magnetic moments of these atoms are easily destabilized by inter- 
actions with electrons, nuclear spins and lattice vibrations of the 
substrate*°. Even when large magnetic fields are applied to stabil- 
ize the magnetic moment, the observed lifetimes remain rather 
short®® (less than a microsecond). Several routes for stabilizing 
the magnetic moment against fluctuations have been suggested, 
such as using thin insulating layers between the magnetic atom 
and the substrate to suppress the interactions with the substrate’s 
conduction electrons”**, or coupling several magnetic moments 
together to reduce their quantum mechanical fluctuations”*. Here 
we show that the magnetic moments of single holmium atoms on a 
highly conductive metallic substrate can reach lifetimes of the order 
of minutes. The necessary decoupling from the thermal bath of 
electrons, nuclear spins and lattice vibrations is achieved by a 
remarkable combination of several symmetries intrinsic to the sys- 
tem: time reversal symmetry, the internal symmetries of the total 
angular momentum and the point symmetry of the local envir- 
onment of the magnetic atom. 

In a free atom the net spin and orbital angular momenta couple to 
the total angular momentum J, resulting in an atomic magnetic 
moment. The continuous rotation symmetry means that the 2J+ 1 
magnetic sub-levels J, = —J, —J+1, ... +J are degenerate in the 
absence of a magnetic field B, as illustrated in Fig. la. If the atom is 


a c 


prepared in a specific magnetic state, this state is not stable against the 
slightest interaction of J with the environment, for example, with a 
thermal bath of electrons or nuclear spins, leading—without an energy 
cost—to transitions to other magnetic states and, hence, to a loss of 
orientation of the magnetic moment. Even the measurement of the 
magnetic state represents such an interaction. 

The magnetic moment of an atom in a memory cell or quantum 
device can be stabilized by placing the atom on a substrate where the 
electrostatic interaction with the neighbouring atoms gives rise to 
magnetic anisotropy. This crystal field splits the formerly degenerate 
magnetic states. A decade ago, a giant magnetic anisotropy was dis- 
covered for single Co atoms on the Pt(111) surface’. Because of the 
broken inversion symmetry, a magnetic moment oriented normal to 
the surface plane is preferred over an in-plane orientation (the normal 
orientation is 9.3 meV lower in energy). To lowest order, this uniaxial 
anisotropy can be described by a simple crystal-field Hamiltonian 
H=DJ?, where D is the uniaxial anisotropy constant. Figure 1b 
depicts the resulting states in a shape of a down-turned parabola 
(D <0) with the two degenerate ground states |+J) and |—J). 

The main mechanism for magnetization reversal are spin-flip events 
mediated by the exchange interaction with substrate electrons. The 
interaction operator can be written as: 


1 
V=Jo=],0,+ > U+o-+]-64) ; (1) 


where a is the spin of the scattering electron and J; and J_ are the 
ladder operators within the J-multiplet (a, and o_ are the analogous 
ladder operators for the spin )**"°. A spin-flip scattering event changes 


Figure 1 | Magnetic sublevels of 
atoms with a total angular 
momentum J. a, Degenerate 
sublevels of a free atom. b, Sublevels 
split by uniaxial anisotropy. c, The 
states mix in the presence ofa crystal- 
field Hamiltonian of C;, symmetry. 
The degenerate green states split into 
symmetric (s) and antisymmetric (a) 
states. The inset shows the fic.c. and 
h.c.p. adsorption sites. 
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J, by +h. From this, one could conjecture that a complete reversal over 
the full barrier is strongly suppressed because it requires at least 27 
scattering events and the energy barrier is large compared to thermal 
energies at cryogenic temperatures (for example, 0.34 meV at 4K). 
This raised the hope of storing a magnetic bit in the two ground states. 

However, in spite of the high magnetic anisotropy, the actual stability 
of the ground state is very limited owing to the strong hybridization of 
the 3d states of the transition atom with the substrate electrons. This 
leads to lifetimes of the excited states of the order of t ~ 10 fs (refs 4 
and 6) and to an energy smearing of the excited states of the order of the 
barrier height due to the Heisenberg uncertainty principle. As a result, 
a reversal is possible without the need for thermal energy. 

Longer lifetimes were achieved with atoms deposited on thin insulat- 
ing films” where the interaction with the substrate electrons is weakened. 
In this case, a close inspection of the magnetic transitions induced by 
the tunnelling electrons of a scanning tunnelling microscope (STM) 
showed more transitions’ than expected from the simple uniaxial 
Hamiltonian. In fact, the magnetic anisotropy of an adsorbed atom 
is hardly strictly uniaxial. Depending on the point symmetry of the 
adsorption site, additional terms enter the crystal-field Hamiltonian 71. 
In general, the crystal-field Hamiltonian can be written as: 


io.9) n n 
H= & (3S Bort S Bror) (2 
n=0 \m=0 m=1 
where B” and B” are the anisotropy constants and O” and O” are the 
crystal-field or Stevens operators. The latter are polynomials of order n 
in the operators J,, J, and J_ with order m in the ladder operators"! and 
induce a zero-field splitting in the multiplet. Owing to time-reversal 
symmetry, states with opposite magnetic moment have the same 
energy for B= 0. Thus, all terms with odd n are forbidden. All terms 
with m=0 only contain powers of J, and thus do not mix the J, 
eigenstates; for example, O3 is essentially the already-introduced oper- 
ator of the uniaxial anisotropy. All terms with n > 2J vanish, because 
the operators act within the 2J+ 1 states of the multiplet. Likewise, 
terms with n > 21 vanish, where / is the orbital angular momentum of 
the individual electrons in the incomplete shell of the atom. Finally, all 
operators vanish that are incompatible with the point symmetry of the 
adsorption site; for example, all B” vanish for a mirror plane normal to 
the surface. 

In the following, we concentrate on atoms adsorbed in three-fold 
hollow sites of the (111) surface of face-centred cubic (fic.c.) crystals. 
Both the f.c.c. and the hexagonal close-packed (h.c.p.) adsorption sites 
have a three-fold symmetry with a vertical mirror plane (and its rota- 
tions) perpendicular to the surface (see inset of Fig. 1c), that is, a C3, 
symmetry. Encouraged by a previous study’’, we used rare earth atoms 
instead of transition metals, because the magnetic anisotropies of the 
former are typically larger than those of 3d atoms, and their 4f orbitals 
hybridize less with the substrate, resulting in potentially longer life- 
times. Thus, for rare earth atoms (with / = 3) in zero magnetic field the 
Hamiltonian can be written as”: 


H = BO} + BO! + BLO} +BiO; + BZOz + BeO8. (3) 


The operators O/” are given explicitly in the Supplementary Information. 
Although the first three terms do not mix the J, eigenstates, the last 
three mix J, states differing in J, by 3 or 6. 

The energy spectrum of the Hamiltonian (equation (3)) for a holmium 
(Ho) impurity (J = 8, with ten 4felectrons’*) and dominating, negative 
BS is illustrated in Fig. 1c. Overall, the parabolic energy dependence is 
preserved when the states are plotted according to their expectation 
value of J,, Every eigenstate is a mixture of several |J,) states differing by 
AJ, = +3 and thus three classes of eigenstates form, marked in Fig. 1c 
by three different colours. Of the states marked in green, the |+3) and 
|+6) states are pairwise degenerate when considering only the first 
three operators in equation (3). They are, however, strongly mixed 
by the last three operators, resulting in new non-magnetic, energy-split 
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states of different symmetry (‘s’ for symmetric and ‘a’ for antisym- 
metric). The red and the blue states form degenerate doublets. 

In general, depending on J, three cases can arise. First, the two 
ground states may be of the same colour and get strongly mixed and 
split. This scenario is often called the tunnelling case. The eigenstates 
are not pure in J,, so the preparation of a pure state leads to tunnelling 
toa state of reversed magnetization by the Stevens operators alone, and 
consequently to short relaxation times”. Second, the two ground states 
Y_,and Y; do not mix, but are linked by spin-flip transitions due to 
Y (see equation (1)). This is the so-called Kondo case where scattering 
with a single electron can lead to transitions between the two ground 
states with zero energy cost. Again, lifetimes are short. At low tem- 
peratures the conduction electrons can even form a Kondo singlet with 
the atom. Third, the transition between the two ground states by 
scattering with a single electron is forbidden (that is, the matrix ele- 
ment of V vanishes). Thus, the magnetic moment of the atom is 
decoupled from the electrons of the substrate to first order, and 
reversal is possible only by sequential scattering via intermediate states 
of higher energy. Depending on the excitation energy, these processes 
can be suppressed at low temperatures, effectively restoring the barrier 
and leading to long lifetimes. 

The last scenario is realized for Ho on Pt(111), that is, 
(W_3|V|W +8) =0. This is a consequence of the combination of the 
symmetries and the value of J and follows from group-theoretical 
arguments. To show this, we investigate how the matrix element 
(Y_s|J;|'%+s), where i € {x,y,z}, behaves under time reversal. For 
B=0, H commutes with the time-reversal operator 7, and 
T J; = —J;T. Further, T? = 1 because the number of electrons in Ho 
is even, J |'‘¥ +3) =|¥z8) (see Supplementary Information) and T is 
anti-unitary, that is, (y|¢) =(T ¢|T x) =(T x|T 6)", where the aster- 
isk indicates the complex conjugate. Thus we can write: 


|p) : 
(Y_s| Ji ¥+8) =(Z¥_s|7 iP +8) 
= — (Ws aiT¥ 4s)" = —(Psal iP—s)” 

= —(P_s[ Jil +8), ie {x,y,z} 


(4) 


Thus, the matrix elements of J vanish as a consequence of symmetries 
and direct transitions between the ground states cannot be induced by 
scattering electrons. We expect long lifetimes at low temperatures. 

In the experiments, Ho atoms were deposited on a clean Pt(111) 
substrate at 4.4 K, leading to individual Ho atoms on the surface (ada- 
toms) as seen by low-temperature STM (compare Fig. 2a). First, a non- 
magnetic tungsten tip was used to image the surface and investigate 
excitations of the atom. The second derivative of the tunnelling current 
I was measured as function of the bias voltage V, while positioning the 
tip over the Ho atoms, as shown in Fig. 2b. The d*I/dV’ spectra taken at 
4.4 K reveal the typical dip—peak structure at a bias voltage V.,, as seen 
in many other single-atom systems**'*. Peak and dip are well sepa- 
rated, speaking for an inelastic excitation as opposed to the Kondo 
effect. As shown later by spin-polarized STM measurements, this 
excitation is magnetic and corresponds to a spin-flip transition from 
the ground state to the first excited state. The analysis of the spectra 
of many atoms reveals that the excitation energy of atoms in f.c.c. 
sites (about 8 meV) is higher than in h.c.p. sites (about 6 meV) and 
that the adsorption energy of f.c.c. sites is higher (see Supplementary 
Information). 

The anisotropy constants can also be obtained from first principles, 
by applying a first-order perturbation theory’* (see the Supplementary 
Information for more details). These calculations confirm that Ho 
on Pt(111) is indeed in a J = 8 state and shows an out-of plane easy 
axis. For f.c.c. Ho atoms, the first excited magnetic state is 7.7 meV 
above the ground state, in good agreement with the experimental 
observation. 
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Figure 2 | Magnetic behaviour of 
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(arbitrary units) 


Ho atoms (J = 8) adsorbed on 
Pt(111). a, Three-dimensional 
topographic STM image of single Ho 
atoms adsorbed on Pt(111) at 4.4K. 
b, Inelastic tunnelling spectrum 
(d°I/dV”) recorded on top of a single 
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Ho atom in the f.c.c. position (red), 
showing an inelastic excitation at 
Ve 8 meV, as determined by a fit to 


di/dV 
(arbitrary units) 


the data (black). c, Time traces of the 
dl/dV signal recorded with a spin- 
polarized tip on a Ho atom in the 
f.c.c. position and on bare Pt 
(V=5mV,I=1nA). Att=0a 


voltage pulse Vpuise above the 
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inelastic excitation threshold was 
applied. Although on bare Pt(111) no 
changes of the dI/dV signal were 
observed, the pulse induced a 
magnetic transition between the two 
ground states of Ho, as deduced from 
the observed change of the spin- 


Probability 


polarized di/dV signal. After some 
delay, the atoms spontaneously 
switch back to the initial state, as seen 
by a second change of the signal 
(indicated by arrow). d, A series of 
switching measurements of Ho 
atoms representing a wide range of 
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function of the pulse height as well as 
a fit (black line) to the data obtained 
from a master equation for the 
induced transitions. f, Decay of 40 
states prepared by pulses (dots) and 
an exponential fit (black line). 
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In the following we address atoms on the high-anisotropy f.c.c. site 
only. The large magnetic excitation energy would lead to a stable 
magnetic moment at low temperatures, because we expect from theory 
that scattering with the electron bath cannot directly switch the mag- 
netization of the atom between the two ground states. Only ifa scatter- 
ing electron (from the tunnelling current, for example) has sufficient 
energy can it bring the atom from the ground state to the first excited 
state, which in turn can relax to either of the two ground states via a 
second inelastic scattering event with substrate electrons. 

To investigate this scenario, we carried out spin-polarized STM 
measurements with anti-ferromagnetic tips that have a spin polariza- 
tion perpendicular to the surface’®. We stabilized the tip over a Ho 
atom at J= 1nA and V=5 mV, which is below the excitation thresh- 
old. Thus, the tunnelling electrons cannot switch the atom but probe 
the spin polarization on top of the atom. The dJ/dV signal depends on 
the relative orientation of the tip magnetic moment and the Ho 
moment. Then we applied 0.2-ms voltage pulses, while the feedback 
was turned off. If the atom switches between the two ground states 
during the pulse, the dI/dV signal is different before and after the pulse. 
We varied the height of the pulse up to 65 mV. 

Figure 2c shows examples of recorded time traces of the dI/dV signal 
at 1.1 K in cases where we observed a switching and, for comparison, 
on bare Pt(111). The spin signal recorded above the atom has two 
distinct values representing the two magnetic ground states with the 
magnetization perpendicular to the surface, whereas the signal on Pt(111) 
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is not influenced by the pulse, thus excluding tip changes. After switching 
the magnetic moment of the Ho atom, it stays in the reversed state for 
times in the range from seconds to minutes until it spontaneously 
switches back, as indicated by the arrows in Fig. 2d. This illustrates 
long lifetimes, as will be discussed below. 

By applying many pulses to an individual atom, we determined the 
probability that a voltage pulse leads to a reversal of the magnetic state 
(see Fig. 2e). In agreement with the inelastic spectrum, we observe a 
threshold for reversal at about 8 mV. Above this, the switching prob- 
ability per pulse increases continuously owing to the increase of the 
tunnelling current with pulse voltage, because the feedback loop is 
opened during the pulse. The switching probability depends on the 
branching ratio of the relaxation from the excited states to either of the 
ground states, so the switching probability contains information about 
the anisotropy constants. The solid line in Fig. 2e was calculated using a 
master equation of the dynamics of the states’ population with aniso- 
tropy constants determined from first principles and fitting the coup- 
ling strength of the Ho spin to the substrate and tunnelling electrons 
(for details see the Supplementary Information). The switching prob- 
abilities for higher-voltage pulses depend strongly on the tip spin 
polarization, which can be determined from the fit to be about 20%. 
The curve fits the experimental data excellently, confirming the quantum 
mechanical nature of the Ho spin. 

To investigate the stability of the ground states, we repeated the 
switching experiment 40 times and plotted the observed population 
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Figure 3 | Lifetimes of adsorbed Ho atoms as function of external 
parameters. a, Spontaneous transitions between the two ground states for 
different magnetic fields along the surface normal observed with spin-polarized 
dI/dV signal recorded at 0.7 K, 5 mV and 50 nA. b, Spontaneous transitions 
between the two ground states for two different temperatures without applied 
magnetic field recorded at 5 mV and 50 nA. ¢, Spin-polarized di/dV map of Ho 
atoms and Co islands on Pt(111) (V = 300 mV, I = 1.5nA). Two distinct 
signals are observed on Co islands magnetized parallel (P) and antiparallel (AP) 
to the tip. Positions of Pt atomic steps are indicated by dashed lines. d, Lifetimes 
extracted from spontaneous transitions at 0.7 K without applied magnetic field 
and 1 nA as function of sample bias on the marked Ho atom in c. Error bars 
represent the standard deviation of a fit to an exponential decay (28, 23, 102 and 
94 transitions for 3.0-mV, 4.0-mV, 4.5-mV and 5.0-mV sample biases, 
respectively). 


decay (see Fig. 2f). The data follow a simple exponential decay with a 
lifetime t of 381 + 49s, indicating an efficient decoupling of the Ho 
spin from the conduction electrons. 

To understand the value of t, we turn back to the original explana- 
tion: direct transitions between the ground states by tunnelling and 
single spin-flip events are forbidden due to the time-reversal sym- 
metry. Thus, transitions between the ground states can only be achieved 
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by excitation to the first (or higher) excited state by substrate or tip 
electrons in the high energy tail of the thermal distribution or by 
breaking time-reversal symmetry using a magnetic field. To verify this, 
we investigated the dependence of t on bias voltage, temperature and 
magnetic field. 

Figure 3a illustrates the effect of a magnetic field applied perpendi- 
cularly to the surface at a temperature of 0.7 K. Similarly to the pre- 
vious experiments, the spin-polarized di/dV signal on top of a Ho 
atom was recorded as a function of time. To increase the signal-to- 
noise ratio, a larger tunnelling current of 50 nA was used while keeping 
V=5mV. In this experiment, we did not apply pulses but observed 
spontaneous transitions between the two ground states. An increase of 
the switching frequency can be clearly observed with increasing mag- 
netic field because spin-flip scattering between the two ground states is 
no longer forbidden (for details see the Supplementary Information). 
Similarly, t decreases with increasing temperature, as illustrated in 
Fig. 3b (V=5 mV, I= 50nA). 

To study the bias dependence, a different sample was used. Figure 3c 
shows a spin-polarized image (dI/dV map) of adsorbed Ho atoms. To 
verify the spin-polarization of the tip, Co islands were deposited that 
decorate the step edges of the Pt(111) substrate. The Co islands are 
magnetized out-of-plane’’ and the contrast between the two islands 
indicates that the tip is sensitive to the out-of-plane magnetization. 
Figure 3d shows 7 as a function of V (I = 1 nA, T = 0.7 K), obtained by 
analysis of spontaneous transitions. Even though the bias voltages are 
below the first excitation energy, t drops exponentially with rising 
voltage. This is due to the exponentially increasing number of thermal 
tunnelling electrons at higher bias voltages that have enough energy to 
excite the atom to the first excited state. Owing to the close proximity 
to the Co island, the Ho atom experiences a non-vanishing magnetic 
stray field of the island. This leads to a shorter t at 5 mV than in the first 
measurement in spite of the lower temperature. However, at a 3 mV 
bias voltage, the lifetime is much longer, t = 729 + 12 s. 

An interesting side effect of the forbidden spin-flip scattering by 
conduction electrons is that magnetic interactions between Ho atoms 
mediated by the Rudermann-Kittel-Kasuya-Yosida (RKKY) inter- 
action’® do not cause the exchange of angular momentum to lowest 
order, such that the observed t is nearly independent of the local 
arrangement of the Ho atoms, as discussed in the Supplementary 
Information. 

In conclusion, general symmetry requirements open up a new path- 
way to stabilize the magnetic moment of adsorbed atoms and decouple 
it from conduction electrons and nuclear spins. This method could 
potentially be used in data storage and quantum computing. By apply- 
ing a magnetic field, the interactions could be switched back on, thus 
allowing controlled quantum manipulation of the spins. 


METHODS SUMMARY 


The Pt(111) sample was cleaned by argon-ion sputtering followed by annealing to 
850 K. The sample surfaces were checked for impurities by STM at 4.4 K before 
deposition of Ho. Ho atoms were deposited onto the clean surfaces at 4.4 K directly 
in the STM. Unpolarized STM tips were prepared from tungsten wires and were 
cleaned in situ by flashing to above 2,500 K. Spin-polarized tips were prepared by 
depositing about 20 monolayers of Mn or Cr in situ onto the tip apex followed 
by gentle annealing, Inelastic d°I/dV” spectra were recorded using a modulation 
frequency of about 16 kHz at a root-mean-square voltage of 2.4 mV and detecting 
the second harmonics with a lock-in technique. Spin-polarized STM measure- 
ments were performed recording the dJ/dV signal with coated tips using a modu- 
lation of 720 Hz and 1 mV root-mean-square voltage. Voltage pulses were given 
with the feedback loop open to avoid destabilizing the tip position. Tip changes 
due to voltage pulses were excluded by checking between every measurement on 
Ho atoms also the time trace of pulses given on bare Pt(111). For tests of the out- 
of-plane sensitivity of the spin-polarized tips, one monolayer high Co islands were 
grown on Pt(111) at 360 K before inserting the sample into the STM. All experiments 
were performed using a home-built cryogenic Joule-Thomson cooled STM”. 
First-principles calculations were carried out within the density functional theory. 
The crystalline structure was determined with the Vienna Ab-initio Simulation 
Package (VASP) method, well known for providing precise total energy and 
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forces”®’. The obtained structural information was further used for calculations of 
the electronic and magnetic properties of single Ho atoms on the Pt(111) surface 
using a self-consistent relativistic full-charge Green’s function method, which is 
specially designed for semi-infinite systems and embedded real-space clusters”. 
A self-interaction correction method and a local density approximation (LDA) 
approximation including Hubbard U corrections were applied to provide an 
adequate description of strongly localized Ho f electrons”. 
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Guided hierarchical co-assembly of soft patchy 


nanoparticles 


Andre H. Gréschel'+, Andreas Walther’, Tina I. Lébling', Felix H. Schacher?, Holger Schmalz! & Axel H. E. Miiller’+ 


The concept of hierarchical bottom-up structuring commonly 
encountered in natural materials provides inspiration for the design 
of complex artificial materials with advanced functionalities’. 
Natural processes have achieved the orchestration of multicompo- 
nent systems across many length scales with very high precision**, 
but man-made self-assemblies still face obstacles in realizing well- 
defined hierarchical structures”. In particle-based self-assembly, 
the challenge is to program symmetries and periodicities of super- 
structures by providing monodisperse building blocks with suitable 
shape anisotropy or anisotropic interaction patterns (‘patches’). 
Irregularities in particle architecture are intolerable because they 
generate defects that amplify throughout the hierarchical levels. For 
patchy microscopic hard colloids, this challenge has been approached 
by using top-down methods (such as metal shading or microcontact 
printing), enabling molecule-like directionality during aggregation’? **. 
However, both top-down procedures and particulate systems based 
on molecular assembly struggle to fabricate patchy particles con- 
trollably in the desired size regime (10-100 nm). Here we introduce 
the co-assembly of dynamic patchy nanoparticles—that is, soft patchy 
nanoparticles that are intrinsically self-assembled and monodisperse— 
as a modular approach for producing well-ordered binary and tern- 
ary supracolloidal hierarchical assemblies. We bridge up to three 
hierarchical levels by guiding triblock terpolymers (length scale 
~10 nm) to form soft patchy nanoparticles (20-50 nm) of different 
symmetries that, in combination, co-assemble into substructured, 
compartmentalized materials (>10 1m) with predictable and tun- 
able nanoscale periodicities. We establish how molecular control 
over polymer composition programs the building block symmetries 
and regulates particle positioning, offering a route to well-ordered 
mixed mesostructures of high complexity. 

Until now, research on block copolymer self-assembly in solution 
focused mostly on multicompartment micelles or the crystallization- 
driven formation of compartmentalized structures bridging one hier- 
archical level'”*'. Only a few studies have considered these nanoscale 
superstructures to be soft colloidal building blocks (CBBs) that can be 
self-assembled on higher levels’”°. The central challenge remains to 
devise patchy CBBs with sufficient precision qualifying as supraparti- 
cular tectons; that is, monodisperse in size and modified with defined 
repulsive and attractive surface patches providing directional inter- 
action patterns’’. Equally problematic while operating on the nano- 
scale are controlled particle positioning (interparticle forces may exceed 
particle size), the general lack of target-oriented and predictable self- 
assembly protocols, and convincing visualization of nano-separated mul- 
tiphase organic materials. Responsive multiblock copolymers (1-10 nm) 
are potential candidates for the versatile bottom-up design (geometry, 
patchiness and dimension) of surface-compartmentalized nanoparti- 
cles (20-50 nm) with molecular precision (Fig. 1a). The particles them- 
selves then transmit the information needed for defined higher-level 
co-assembly into mesoscale structures (0.1-10 |tm) of controllable size 
and periodicity. In contrast with ‘static’, patchy microparticles (hard 


spheres)'*”””8, these polymer-based nanoparticles are intrinsically self- 
assembled, soft and ‘dynamic’, offering the attractive feature of assem- 
bly or disassembly on demand—from molecules to CBBs and beyond. 
We recently approached the design of such a system by fabricating 
near-monodisperse, monovalent and divalent CBBs (‘monomeric units’) 
by the self-assembly of ABC triblock terpolymers in selective solvents”. 
Shaping building-block geometries and understanding both interpar- 
ticle interactions and aggregation behaviour led us to the hypothesis 
that the concept of soft nanoparticle self-assembly could be extended to 
the rational design of supracolloidal co-assemblies by suitable combi- 
nations of building blocks. Mixed particle co-assemblies across mul- 
tiple hierarchical levels open a new level of complexity and have yet to 
be addressed. 

The general design criteria—developed in an experimental approach— 
for the guided co-assembly of multiple CBBs with distinct valences are 
few and simple, and should be widely applicable to polymer-decorated 
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Figure 1 | Preparation and configuration of soft colloidal building blocks 
(CBBs). a, Dispersion of ABC triblock terpolymers in a non-solvent for B 
yields B-core particles with A/C corona patches. During transfer into a non- 
solvent for A and B, these develop into monovalent ABS and divalent 

AB“A units with sticky A patches. b, CBBs feature associative (‘sticky’) 
polystyrene (S) patches reversibly made solvophobic on demand, chemically 
different (inert) polybutadiene (B) or poly(3-butenyl(dodecyl)sulfane) (D) core 
material physically holding the CBBs together, and poly(methyl methacrylate) 
(M), poly(tert-butyl methacrylate) (T) or poly(2-vinylpyridine) (V) as 
solubilizing/repulsive corona varying in polarity and functionality. 
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particles (Fig. la). One essential requirement is tunable and defined 
attractive interaction patterns responsive to solvent quality or other 
external stimuli to favour near-monodisperse structures on different 
length scales. Modern polymer synthesis provides us with well-defined 
block copolymers with a wide range of properties and responses”. We 
developed a set of ABC triblock terpolymers into which we pre-encoded 
all parameters necessary for sequential, hierarchical assembly (Fig. 1b). 
The volume ratio of the core-forming segments, V,/Vp, determines 
CBB valence, and the total molecular mass, M,, controls particle size”. 
Thus, Va/Vz > 1 yields monovalent ABS Janus CBBs with one attract- 
ive A patch and one repulsive C patch on opposing sides of the B core 
(we use bold lettering to distinguish CBBs from the underlying poly- 
mer chains with regular lettering); V4/Vp < 1 leads to divalent ABCA 
with two attractive A patches on opposing sides of the B core and a 
repulsive C patch emanating radially from the B core. The superscript 
indicates that the C corona is attached to the B core (Extended Data 
Table 1). From here onwards we replace ABC by the actual block 
sequences: SBM (polystyrene-b-polybutadiene-b-poly(methyl meth- 
acrylate)) and SDM (polystyrene-b-poly(3-butenyl(dodecyl)sulfane)- 
b-poly(methyl methacrylate). Monovalent SB™ and divalent SDS CBBs 
were prepared separately by self-assembly in N,N-dimethylacetamide 
(DMAc), anon-solvent for the middle blocks, B and D (Extended Data 
Fig. 1). At this stage the S and M patches are still soluble and not yet 
completely phase-separated, and they reorganize dynamically into 
fully developed CBBs when the S block is made insoluble. We trigger 
this step by changing the solvent to acetone/propan-2-ol (60:40 v/v), 
leading to particles with essentially different patch arrangements. SB™ 
and SD™S thereby act as monomeric units for self-assembly and co- 
assembly on the next level (transmission electron microscopy (TEM) 
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Figure 2 | Guided co-assembly across multiple hierarchical levels. Level 0: 
triblock terpolymers are the basic building blocks. Level 1: self-assembly of 
monovalent s-SB™ and divalent SDMS CBBs in dependence on the volume 
ratios of the core-forming blocks (intermediate CBBs were captured by 
crosslinking; Extended Data Fig. 2). Level 2 self-assembly: s-SB™ forms 
spherical (SB™),, clusters and SD™S linear [SD™S],, supracolloidal polymer 
chains. Level 2 co-assembly: s-SB™ and SD™S with mutually attracting S 
patches co-assemble into mixed superstructures stabilized by the common M 
corona. (OsO, staining: -S- segments grey, B cores dark grey, -D- segments 
black, and M not visible as a result of degradation by the electron beam.) 
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images in Fig. 2, level 1, and Extended Data Fig. 2). Ifkept separate, both 
units undergo self-assembly into spherical and linear superstructures 
to minimize energetically unfavourable S patch/non-solvent interfaces 
(Fig. 2, level 2 self-assembly), once the solvent quality has been reduced 
to a critical threshold for the S patches. Small corona volumes, Vy, 
provide less steric repulsion and promote higher aggregation numbers 
of SB™ units per spherical (SB™), cluster or, similarly, higher degrees 
of polymerization of SD™S units per [SD™S],,, supracolloidal polymer 
chain (dimer—oligomer— polymer)”. 

Co-assembly requires at least two CBBs differing in size (10-100 nm), 
patchiness or chemistry (core/corona). To demonstrate our concept, 
we mixed two CBBs differing in all three aspects in DMAc in defined 
particle ratios (s-SB“:SD™S = 8:1; Fig. 1 and Extended Data Fig. 2)?” 
(the prefix ‘s’ in s-SB™ stands for small). We chose small monovalent 
s-SB™ (hydrodynamic radius R, ~ 10 nm) in combination with much 
larger divalent SDMS (R,, ~ 50 nm). In DMAc, the core-forming seg- 
ments (B and D) are immiscible; the two CBB species therefore do not 
exchange terpolymer chains and so evolve independently. As observed 
for individual species, changes in solvent quality for the S patch desta- 
bilize the SD™'S units and induce aggregation into supracolloidal poly- 
mer chains with an [SD™S],, sequence. On a similar timescale, the 
s-SB™ units start to aggregate, yet instead of spherical clusters (self- 
assemblies), they selectively attach to the newly formed free surface of 
-S- segments within [SD™S],,,, thereby decreasing the -S-/non-solvent 
interface (Fig. 2, level 2 co-assembly). In the presence of both particles 
we observe exclusively co-assembly under these conditions. This is 
surprising, because aggregation of both CBBs is driven solely by a weak 
non-directional force (solvophobicity) and each CBB is able to form 
stable populations of spherical and linear superstructures by them- 
selves. Yet the development of [SD™'S],,, supracolloidal polymer chains 
favours the attachment of s-SB™ units to the -S— segments. Therefore, 
we associate this phenomenon with a certain level of cooperativity. We 
chose s-SB™ units with a particle diameter (dcpp ~ 19 nm) matching 
the width of the -S- segments of SD™S (w ~ 24nm) (Extended Data 
Fig. 2) and indeed found a defined number of seven to nine s-SB™ units 
radially covering the -S— segments, which perfectly reflects the original 
mixing ratio. On exceeding the loading capacity of the -S- segments 
(for example s-SB™:SD™S = 35:1), single s-SB™ CBBs or raspberry- 
like (s-SB™), self-assemblies locate in the vicinity of the fully decorated 
co-assemblies (Extended Data Fig. 3). 

Besides suitable particle ratios, the timescales of aggregation during 
solvent exchange have to correlate to facilitate proper co-assembly. 
Because the corona volume of the s-SB™ units affects the critical solvent 
composition needed for aggregation, we used terpolymers with differ- 
ent lengths of the M block (y stands for s (small), m (medium) or 1 
(large) in Extended Data Table 1 and Extended Data Figs 4 and 5). 
Divalent SD™S CBBs self-assemble under solvent conditions in which 
most s-SBM prevail as ‘monomeric’ units. Stability against aggrepa- 
tion is best for s-SB™“? with the largest M corona. Whereas s-SB™“™ 
leads to simultaneous co-assembly, s-SB™ shows a slight delay. However, 
in both cases about eight units decorate the -S— segments, pointing to a 
robust process with sufficient dynamics for rearrangements. However, 
particles with too short a corona, for example s-SB™, are unstable 
before SDMS polymerization, and lateral decoration is absent. Instead, 
fully developed spherical (s-SB™’),. clusters are incorporated into the 
linear [SD™S],, superstructure during the step-growth polymerization 
of SDS (Extended Data Fig. 4). 

The overall particle size, x-SB™) of the CBBs (x stands for s (small), 
m (medium) or | (large)) markedly affects the number of particles 
attached to each -S- segment of the [SD™S],,, supracolloidal polymer 
chains (Fig. 3 and Extended Data Fig. 6). The particle diameter of 
s-SB™ units, depp ~ 19 nm, allows the incorporation of eight particles 
on average, also slightly enlarging the width of the -S- segments from 
w ~ 24nm to w ~ 30 nm (Figs 2 and 3a). Larger CBBs such as m-SB™ 
(dcpp © 35 nm) require more space; only one particle is therefore able 
to attach to each -S- segment (Fig. 3b). As a result of their size and the 
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Figure 3 | Size-selective attachment and control of supracolloidal polymer 
chain length. a, The -S- segments accommodate seven to nine s-SB™ units. 
b, Larger m-SB™ units fit only once and induce strong kinks. c, -SB™ units are 
too large for lateral decoration and instead act as selective end-cappers. 

d, e, Length control of supracolloidal polymer chains by the mixing of 100 
(d) and 10 (e) SDMS units per two 1-SB™ end-cappers (Extended Data Fig. 7). 
Inset shows [SDS], nanostructure. f, Polydispersity indices (PDI; top) and 
linear dependence of SD™S repeating units on the mixing ratio (bottom). 


accompanied volume displacement inside the -S— segments, m-SB™ 
units induce strong kinks altering the flexibility and linearity of the co- 
assembly, which could potentially influence the rheological character- 
istics and colloidal chain packing. 1-SB“ and units substantially 
exceeding the width of the -S- segments (dcp ~ 47 nm) are too large 
for lateral decoration and specifically locate on terminal positions 
(Fig. 3c). The relative sizes of the monovalent units and the -S— segment 
of the divalent units govern the location and loading capacity, whereas 
the size of the stabilizing M patch determines the timescales and extent 
of co-assembly. 

Selective end-capping provides an attractive handle to control the 
length of the [SD™S],,, supracolloidal polymer chains. Adding I-SBM 
‘stoppers’ in specific ratios also allows end-functionalization with chemi- 
cally different CBBs. Without any end-capper, the SD™'S units grow 
into extended structures in the region of 30 um long (more than 600 
repeating units; Extended Data Fig. 7), reaching scales visible by optical 
microscopy (Extended Data Fig. 6). Figure 3d—f summarizes the depen- 
dence of the length distribution of the co-assemblies on the mixing 
ratio SD™S:1-SB™ (Extended Data Fig. 7). The decrease in mean length 
is evident when comparing the TEM images in Fig. 3d, e, which show 
mixing ratios of 100:2 and 10:2. Stoichiometric control as known from 
step-growth polymerizations is corroborated by the linear dependence 
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of the average degree of polymerization on the mixing ratio (Fig. 3f). 
The deviation from the theoretical values (dashed line) is caused by 
residual ‘defects’ in the CBBs, because we find a small proportion (less 
than 1%) of trivalent $;D™ CBBs that induce branching. Matching 
patch sizes of CBBs and low corona volumes of the SB” units are 
decisive in efficient end-capping: the first effectively prevents the addi- 
tion of SDMS monomeric units and the latter causes the SB™ units to 
interfere directly with SD™S polymerization. Both factors can be pro- 
grammed molecularly into the SB™ units by variation of the total mole- 
cular mass (particle size) and the fraction of M (onset of aggregation). 

Controlled mixing of different CBBs permits the rational design and 
precise implementation of functionalities within the core and/or cor- 
ona of the co-assemblies (Fig. 4). We prepare hybrid materials by 
selectively encapsulating 10-nm maghemite nanoparticles within the 
-D- segments of SD“S CBBs on hierarchy level 1 (Fig. 4a). Selective 
and reversible loading yields a unique advantage compared with top- 
down approaches that require the synthesis of tailored colloids. The 
preloaded CBBs then polymerize further to linear chains with altern- 
ating D-loaded/S-empty compartments. Nanoparticles with tailored 
affinities for other phases may allow the preparation of bar-coded 
distributions, and magnetically responsive co-assemblies may serve 
as advanced viscosity modifiers or may act as deterministic carrier 
systems. We also generated co-assemblies comprising five different 
environments (three core/two corona compartments; Fig. 4b) by mix- 
ing SD™S with SB’ (T = poly(t-butyl methacrylate), which leads to 
the already established [SDs], core sequence, yet now with a seg- 
mented M/T corona. In principle, such structures allow us to control 
the positioning of nanoparticles along the co-assemblies or to imple- 
ment predefined responsive folding sites. At this point we emphasize 
enhanced superstructure stability, because the pristine [SD™S],,, imme- 
diately precipitates in ethanol (non-solvent for M) and the SB"/ 
[SDS], co-assemblies remain stable (Extended Data Fig. 8). 

We extended this approach and designed multiblock co-assemblies 
by combining two divalent units, SD“S and SBS (Fig. 4c). Here, several 
short segments of [SD™S],,, or [SB™S],, sequences are coupled into multi- 
block co-assemblies. We suggest that the homogeneity and sequence 
distribution depend on how synchronized self-assembly and co-assembly 
occur and that deeper kinetic studies will eventually enable control over 
the sequence length. In an analogy with polymer architectures, we fab- 
ricated telechelic oligomers with terminal functional groups by end- 
capping [SDMS],o oligomers with SBY (V= poly(2-vinylpyridine); 
Fig. 4d). The number-average length of the telechelics scales with the 
mixing ratio SD'S:SB. The V corona is clearly visible as grey end-cap 
by TEM (Fig. 4d, central inset). Protonation of the V corona with HCl 
causes solvophobic attraction and triggers aggregation of the V-termini 
into extended subdivided superstructures. We emphasize that this 
process is fully reversible and that it corresponds to a step-growth 
polymerization on two levels: first for the individual SDS units, 
and then chain extension of the telomerized ‘macromonomers’. Com- 
bining selective end-capping and lateral decoration yields ternary 
structures. The addition of SB™ units to the SBY-[SD™S],.-SB’ tele- 
chelic oligomers does indeed lead to the decoration of the -S- segments 
and a final composition of SBM:SD“S:SBY = 10:10:2. 

Finally, we bridge three hierarchical levels by co-assembly of sB™ 
and SD’S building blocks into end-capped colloidal molecules with 
the composition (SB™) N7 [SD‘S] 1-(SB™) iy (where the number of end- 
caps N = 1-5). The CBB mixing ratio determines the number of end- 
cappers; for example, sB™.sp'S = 2:1 gives (SB™),-[SD‘S],-(SB™),. 
These co-assemblies self-assemble spontaneously into triangles and 
further into two-dimensional networks when cast onto substrates, a 
necessary two-dimensional confinement tool’*”°, and the number of 
end-caps (SB™) directs the number of nearest neighbours at the net- 
work linkages (Fig. 4e). The self-assembly of these colloidal molecules 
is directed by the M/T corona patches that change their volume on 
solvent evaporation and develop into attractive M and repelling T 
patches. For N= 1 or 2 the end-cap size allows three or four nearest 
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Figure 4 | Binary and ternary co-assemblies. a, Hybrid co-assemblies with 
magnetite nanoparticles selectively loaded into one core segment. b, Core and 
corona co-assemblies. The [SD™S],, superstructure is decorated with 

SB’ CBBs, yielding an alternating M/T corona. c, Linear multiblock co- 
assembly composed of [SD™S],,,/[SB™S],, sequences (M corona omitted for 
clarity). d, SBY end-cappers produce telechelic [SD™S],0 oligomers (grey V 


neighbours, {SB™-SD'S-SB™}, 5, whereas for N = 3-5 there is only 
space for two nearest neighbours and the networks show the onset of 
distorted kagome lattice formation (Extended Data Fig. 9). 

These hierarchical multicomponent superstructures demonstrate 
the level of complexity reached through the controlled co-assembly 
of soft patchy nanoparticles and is in stark contrast to the self-assembly 
of increasingly complex building blocks (shape and surface pattern). 
Using proper interacting segments, a wide range of building block 
combinations is conceivable, from biological origin to inorganic and 
organic nanoparticles and microparticles. Various self-assembly stimuli 
can conveniently be implemented by means of functional polymer 
blocks, for example supramolecular interactions, chemical reactions 
and environmental triggers (solvent polarity, pH, temperature, light or 
electrochemistry). Co-assembly opens up avenues to construct new 
materials, also far from thermodynamic equilibrium, through the tai- 
lored spatial organization of functionalities and the control of kinetics. 
We foresee application possibilities in smart materials, sensing, photo- 
nics and nanolithography. 


METHODS SUMMARY 


Relevant parameters of SBM, SBV and SBT triblock terpolymers are summarized 
in Fig. 1 and Extended Data Table 1. SBM and SBT were converted to SDM and 
SDT by thiol-ene click reaction of 1-dodecane thiol to the poly(1,2-butadiene) 
block. All CBBs were prepared in DMAc at a polymer concentration of 0.1 g1~' 
and annealed overnight at 70 °C. Particle dispersions were mixed in specific par- 
ticle ratios and 10 ml of solution was co-dialysed (molecular mass cut-off 12,000- 
14,000 gmol'; Roth) against 51 of selective solvent or solvent mixture for the 
corona block (acetone/propan-2-ol for M and V; ethanol for T corona). The 
solvent exchange was monitored by ‘H-NMR. TEM was performed in bright-field 
mode on Zeiss CEM 902 and 922 OMEGA electron microscopes operated at 80 kV 
and 200kV, respectively. Co-assemblies were deposited by drop-casting (0.05 ml 
of 0.1mgml~’ dispersion) onto carbon-coated copper grids resting on a filter 
paper to blot excess solution immediately. Samples were exposed to OsO4 vapour 
for 2h to stain the polybutadiene blocks selectively. 
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corona surrounding the SBY end-cap in inset). Protonation with HCl triggers 
further polymerization, and decoration with SB™ units yields ternary co- 
assemblies. e, Multicomponent structuring across three hierarchical levels. In 
solution, the terpolymers self-assemble into CBBs that co-assemble into 
colloidal molecules; these finally self-assemble into networks after drop-casting 
on a substrate. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Particle preparation. All terpolymers were synthesized by sequential anionic 
polymerization*’~**. Important parameters are summarized in Fig. 1 and Extended 
Data Tables 1. All solvents used were of analytical grade. Dialysis tubes of regen- 
erated cellulose with a molecular mass cut-off of 12,000-14,000 gmol | were 
purchased from Roth. Oleic acid-stabilized maghemite nanoparticles were synthe- 
sized as described elsewhere**’. Both polymers, SBM and SBT, were converted to 
SDM and SDT, respectively, by means of a thiol-ene click reaction of 1-dodecane 
thiol with the pendant double bonds of poly(1,2-butadiene). In a typical experi- 
ment, terpolymer (0.5 g) was dissolved in 20 ml of tetrahydrofuran; 20 ml of 
1-dodecane thiol were added and the solution was purged for 15 min with argon. 
The mixture was irradiated for 24h with an ultraviolet lamp with a cut-off filter 
(A= 300nm; Amax = 360nm). After the reaction, the modified terpolymer was 
precipitated in propan-2-ol, filtered, and washed with excess propan-2-ol to remove 
remaining 1-dodecane thiol. SBM, SDM, SBT, SDT and SBV triblock terpolymers 
were dissolved separately in DMAc at an initial polymer concentration of 0.1 g1"' 
and annealed overnight at 70 °C to guarantee an equilibrated system. At this stage 
of self-assembly the corona patches were not yet fully developed. To reveal the 
patches in the developed state (Extended Data Fig. 2), the CBBs were crosslinked 
while located within the respective superstructure. Therefore, 2 equivalents of the 
photo-crosslinker 2,4,6-trimethylbenzoyldiphenylphosphineoxide (Lucirin TPO; 
J max ~ 360 nm), were added to each double bond. Gentle stirring for 2h ensured 
a homogeneous distribution of the photo-crosslinker before the samples were 
irradiated for 1h with an ultraviolet lamp with a cut-off filter (2 = 300 nm). Re- 
dispersion in DMAc as a good solvent for the patches then broke up the super- 
structures into the respective CBBs. 

Self-assembly and co-assembly. CBB solutions (0.1 g!”' in DMAc) were mixed 
in specific particle ratios to yield 10 ml ofa colloidal mixture and co-dialysed against 
51 of selective solvent/solvent mixture for the corona block (acetone/propan-2-ol 
for the M corona and V corona block, and ethanol for the T-corona block). The 
solvent exchange was monitored by ‘H-NMR. TEM was performed in bright-field 
mode on Zeiss CEM 902 and LEO 922 OMEGA electron microscopes operated at 
80kV and 200kV, respectively. Samples were prepared by dropping 0.05 ml of a 
0.1 mg ml’ colloidal suspension onto carbon coated copper grids resting on a 
filter paper to remove the excess solution immediately. The two-dimensional net- 
works were prepared similarly, except from 0.01 g1” ' colloidal suspension that was 
allowed to settle on the TEM grid for 30s before blotting. TEM grids were then 
exposed for 2h to OsO, vapour to stain the polybutadiene block selectively. The 
number of polymer chains in each CBB was determined, evaluating at least 250 
cores. The average degree of polymerization of each [SD™S],, colloidal polymer 
was determined by counting the black segments of 500 colloidal polymers and 
plotted as frequency distribution. 

Correlating co-assembly with onsets of aggregation. We used a CBB ratio of 
s-SB“©:SDS = 8:1 for all experiments to ensure that enough monovalent CBBs 
were present for full decoration of the -S- segments. SD™S was co-dialysed with 
three different s-SB™” units with varying volume of the corona, M(y) (y stands for 
s (small), m (medium) or | (large)). The dialysis sequence was from DMAc into 
acetone/propan-2-ol mixtures. Greater amounts of propan-2-ol led to a contrac- 
tion of the M corona. SD™S forms extended superstructures irrespective of the 
propan-2-ol content (also in pure acetone). s-SBM CBBs with a large relative 
volume fraction of the soluble M corona of ry, = 0.74 (where ry = Viy/(Vs + Vg)) 
do not co-assemble at low propan-2-ol contents (less than 20 vol%) and are only 
located in the vicinity of polymerized SDMS (Extended Data Fig. 4). The long 
corona blocks provide steric stabilization by fully covering the CBB. At high 


propan-2-ol contents (30 vol%), partial co-assembly is observed and quantitative 
co-assembly only at 40 vol% as a result of progressive contraction of the M corona. 
Because the SD™S units are already fully polymerized, decoration does not inter- 
fere with the growth process (no premature end-capping). s-SBM™ CBBs with 
rm = 0.34 carry a corona block of moderate length and show partial to quantitative 
co-assembly over the whole range of solvent mixtures, indicating only a slight 
difference in onset of aggregation between SDS and s-SBM“™. s-SBM® CBBs 
with the shortest corona block, ry, = 0.15, show no co-assembly in either of the 
solvent compositions: at a propan-2-ol content of only 10 vol% the corona shows 
insufficient stabilization for the s-SB@“ CBBs, and aggregation into spherical 
(s-SBM),. clusters occurs before polymerization of SDMS. The SD™S units still 
polymerize through these spherical clusters, resulting in randomly distributed 
bulbs of (s-SB™),. clusters along the [SD™s],, chain. 

Calculation of polymer chains in each particle and compartment volumes. 
Because CBBs form dynamically during the self-assembly of triblock terpolymers, 
one needs to know how many polymer chains are located within one CBB to be 
able to mix CBBs in the desired ratios for the co-assembly process. We determined 
the average core diameter of each CBB by TEM (averaged over 250 samples) to 
calculate the aggregation number of triblock terpolymer chains, Nagg, in each CBB. 
This evaluation is exemplified on monovalent spherical s-SB™ and divalent, cylin- 
drical SD™S cores (Extended Data Fig. 2). Nagg was then calculated from 


Meore N. APB 
Nagg chain Voore chain 
m M, 
B B 


with Veore=4/3(nR? 


‘core 


) for spherical SBM and Veore = wnR2 


‘core 


for cylindrical 
SDMS; more is the mass of the micellar core, méhain is the mass of the middle 
block, Mghain is the molecular mass of the middle block, Na is Avogadro’s constant 
and px is the density of the middle block. We found that this approach yields a 
good correlation between the ratio of triblock terpolymers and the final composi- 
tion of CBBs in the co-assemblies. We can calculate the diameter of SB™ units 
combining the calculated volume for the B compartments, Vp, from Nagg with the 
volume ratio Vs/V . The diameter of spherical CBBs, dcgp, is then given by 


dcpp =2 av . 
CBB = st 5M) a7 


The volume and surface area of the cylindrical -S- segments are simply deter- 
mined from TEM data; that is, directly from the colloidal co-assembly as width (w) 
and height = diameter (d). 
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Extended Data Figure 1 | Experimental approach to prepare colloidal co- mixed in specific ratios and stirred overnight to ensure homogeneous 
assemblies exemplified on s-SB™ and SD™S. a, First, the triblock terpolymers _ distribution. c, Co-dialysis into a non-solvent for the S patches induces collapse 
are dispersed separately in DMAc as a non-solvent for B and D, equalling and aggregation. SD™S grows into extended linear colloidal polymers 
bottom-up structuring of CBBs with different cores (B and D), yet identical decorated by s-SB™ units (M corona omitted for clarity). d, Final colloidal 
‘sticky’ S patch and stabilizing M corona. b, Both colloidal dispersions are co-assemblies stabilized by the common M corona. 
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Vcap=Vst0.5(Vp) 


Extended Data Figure 2 | Compartment sizes, polymer chains per CBB and 
CBB mixing ratios. a, SBM with Vs/Vz = 1.70 and SDM with Vs/Vp = 0.68 
result in b, monovalent SB™ (Janus) and divalent SD™S units, respectively. 
CBBs are visualized after crosslinking of the remaining double bonds within the 
particle cores. c, Examples of spherical SB™ clusters and linear SD™S colloidal 
polymers. d, Diagrams of CBB compartment volumes and surface area of 
associative patches. We determine the number of SB™ units able to attach to the 
-S— segments of the SDS colloidal polymers by calculating the volume and the 
diameter of the CBBs, assuming a spherical shape of B and S phases and 
considering the number of polymer chains per patch. The spherical SBM units 
are composed of (i) the body as the sum of the collapsed S patch (grey) and B 
core (black) and (ii) the M corona patch. The radius, Rcpp, and the volume, 
Vcpp, displaced by the CBB when aggregating into the -S— segments are 
estimated by combining the volume ofS plus half of the volume of B (the dashed 
line marks Rcgg). These assumptions are valid for the collapsed state, given the 
unfavourable interactions with the surrounding medium and the minimization 
of the interfacial energies. 
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Extended Data Figure 3 | Loading capacity for decoration with spherical 
CBBs during co-assembly. SD™S and s-SB™ colloidal co-assemblies with 
increasing mixing ratios 2:1, 4:1, 8:1 and 35:1 of s-SB™:SD™S and 
corresponding co-assembly composition. a, At small excess, 

s-SB™:SD™S = 2:1, the -S— segments are only partly decorated. b, c, With 
increasing CBB ratio, the -S- segments of the colloidal polymers are more 
strongly occupied (4:1) (b) and at full capacity in a radial manner (8:1) 

(c).d, Above this ratio, no space remains on the colloidal polymer and only then 
does s-SB™ start to form homo-clusters (35:1; football-like or raspberry-like 
homo-clusters adjacent to the colloidal co-assemblies). These observations fit 
well with our calculations, as a cylinder with d_s_ ~ 55nm and w ~ 24nm can 
accommodate up to nine s-SB™ units with depp ~ 19 nm (nd. ~ 173 nm; 
173 nm/19 nm ~ 9.1). Scale bars in insets are 200 nm (OsO, staining: S grey, 
B dark grey dots, D black, and M not visible as a result of degradation by the 
electron beam.) 
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Extended Data Figure 4 | Co-assembly of s-SBM© and SDMS in dependence 
on the corona volume of the monovalent s-SB™™ units ( ystands for s, m orl). 
Colloidal co-assembly occurs preferentially with matching onsets of aggregation 
that depend on the corona volume, which is tunable either by decreasing the 
block length of the corona block, ry = Vyy/(Vs + Vp), ry(s-SBM) = 0.74, 
ryu(s-SBM) = 0.34, ru(s-SB™) = 0.15, or by reducing the solvent quality, 


aoe Eero 20% i-PrOH 


small 
corona 


30% i-PrOH 40% i-PrOH 


ee et eS Be ae 


here the addition of propan-2-ol from 10 vol% to 40 vol%. Thereby, crosses 
indicate no co-assembly, circles the onset of co-assembly and tick marks 
effective and quantitative co-assembly. For detailed discussion see Methods. 
(OsOs, staining: S grey, B dark grey dots, D black, and M not visible as a result of 
degradation by the electron beam.) 
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Extended Data Figure 5 | Timeline of CBB aggregation and response of 
CBB constituents on solvent composition. a, Co-dialysis of CBBs (here 
SDS and s-SB™™) from DMAc (squares) into acetone (circles)/propan-2-ol 
(triangles) mixtures (60:40 v/v) results in a continuous change of a multitude of 
polymer-polymer interactions as well as polymer block responses to the 
ternary solvent mixture. With progressing dialysis, DMAc is replaced by 
acetone/propan-2-ol, affecting the solubility of S/M corona patches. This 
induces the collapse into S patches and determines the onset of aggregation. 
Time-dependent 'H-NMR measurements were performed by drawing samples 


at specific intervals during dialysis to determine the solvent composition. 

b, TEM series exemplifying the timeline of aggregation on s-SB™” and 
SD™S. Up to 30 min, S corona collapses to form S patches, yet both CBBs are 
still stable due to stabilizing corona; after 30 min, SD™s (smaller M corona) 
aggregate; after 45 min, aggregation of SDS proceeds, while s-SB™ is still 
stable (larger M corona, no co-assembly); at 60 min, s-SB™” aggregates and co- 
assembly takes place; up to 90 min, co-assembly is complete. (OsOy, staining: 
S grey, B dark grey dots, D black, and M not visible as a result of degradation by 
the electron beam.) 
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Extended Data Figure 6 | Supporting images of colloidal co-assemblies -S- segments. f, Zoom-out of rod-like colloidal oligomers end-capped with 
composed of SDS and s-SB™, m-SB™ and I-SB™. a, Optical microscopy 1-SB™. g, Magnification of f. h, The smallest possible co-assembly with 
image of colloidal co-assemblies 30 jim in length. b, c, TEM images of fully [SDS]; that is, two end-caps attached to one SDMS unit. (OsO, staining: 
decorated -S- segments with s-SB™. d, e, Larger m-SB™ units inducekinksdue S$ grey, B dark grey dots, D black, and M not visible as a result of degradation 
to lateral, alternating decoration and high volume displacement within the by the electron beam.) 
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Extended Data Figure 7 | Length control of colloidal polymers via end- 
capping with |-SB™. a, Without any |-SB™ end-capper added, the SDMS units 
grow into remarkably extended superstructures several micrometres in length, 
exceeding 500-600 SD™S repeating units. The inset illustrates the segmented 
core and some occurrence of branching. The displayed image consists of an 
overlay of nine separate TEM images, because the superstructure was far too 
large for magnifications that were still able to resolve the nanostructure. The 
dashed box marks the part of the colloidal polymer that is shown in Fig. 3d. 
b, TEM images and frequency distributions (red bars) of 500 evaluated colloidal 
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polymers for varying SD™S to 1-SB™ mixing ratios of 40:2, 20:2, 10:2 and 5:2. 
Scale bars, 5 um. We count segments and multiply the resulting average value 
by the average segment length, L,, = 55 nm, to yield the average length of the co- 
assembly. Data are plotted against the normalized frequency and fitted by a 
Schulz-Flory frequency distribution (squares), F(L) = exp(—L/Ln)/Ln; 
showing a continuous decrease in the average length coinciding with the mixing 
ratio. The similar timescales of onset of aggregation combined with matching 
sizes of CBBs both promote efficient end-capping. 
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Extended Data Figure 8 | Stability of [SD™S],, colloidal polymer versus [SD™S],,, colloidal homopolymer as a result of complete collapse of the M 
SB"/[SD™S],,, co-assembly in polar solvents. Photograph of two colloidal corona. Right: the SB"/[SD™S],,, colloidal co-assembly is stabilized by the 
solutions after dialysis into ethanol. Left: instant precipitation of the soluble T corona patches. Precipitation is not observed even after weeks. 
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Extended Data Figure 9 | Colloidal molecules and two-dimensional 
network formation. a, Co-assembly of four SB™ with one SD'S CBB by means 
of collapsing S patches in acetone/propan-2-ol (60:40 v/v) yields colloidal 
molecules with one or two end-caps of the (SBM),-(SD7S)-(SB™), type, 
permitting three or four nearest neighbours at every linking point in large-area 
networks. b, Co-assembly of six SB™ with one SD’S CBB by means of 
collapsing S patches in acetone/propan-2-ol (60:40 v/v) yields colloidal 
molecules of the (SB™);-(SD'S)-(SB™), type. c, Addition of propan-2-ol until 
acetone/propan-2-ol reached 20:80 (v/v) selectively collapses the terminal M 
corona, allowing two nearest neighbours at every linking point of the network 
in solution. On some occasions the network takes on the form of a distorted 
kagome lattice. 
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Extended Data Table 1 


Characteristics of triblock terpolymers and CBBs 


CBB ___Triblock Vos 1 2 Vet dese’ Recor! Vel ue Vs 
Code” _Terpolymert V, ¢ V,+V, [nm] [nm] —_[x10? nm?] 99 [x10? nm?] 
ABSA CBBs 

SD™S Sse0Dse0Ms54019 0.37 0.33 55x24 55x30 71.2 370 26.3 

spD's SsioDsaoT4607® = 0.36 0.44 60x30 = 57x38 96.1 385 34.6 

SB™S Ss90B2700Mse0265 0.38 0.25 100x31 100x50 394.0 2300 149.7 

s-AB°”) CBBs of similar size, yet decreasing corona length (I), (m), (s) 

s-SBM()) S310B150M34074 3.57 0.74 19 522 0.5 40 1.9 

s-SBM(™ = S510Bze0M260% 4.20 0.34 28 722 1.4 60 5.9 

s-SBM(s) S700Bz270M180°° =» 4.50 0.15 29 7+2 1.4 57 6.3 
x-AB° CBBs of different overall size s-, m-, I-. 

s-SB™ S310B150M340"4 3.57 0.74 19 512 0.5 40 1.9 

m-SB™ SseoBsaoMsa0%3 = 1.78 0.88 35 1142 5.6 140 9.9 

I-SB™ Se10BesoMz90"49 = 1.72 0.26 47 1542 14.1 260 24.3 
AB CBBs with different nature of the corona 

sBT Ss510Bs4oT 480155 1.70 0.45 28 9+2 3.1 60 5.2 

SBY S360BssoVs0'20 =: 1.67 0.95 22 7+2 1.4 54 2.4 


* The syntheses and characterization of SBM, SBV and SBT triblock terpolymers were reported in detail previously***°. Superscripts denote the size of the soluble corona patches in relation to the other CBBs: s, 


small; m, medium; |, large. 


+ Subscripts denote the degrees of polymerization of the corresponding blocks, and the superscript is the molecular mass in kg mol! determined with combined 'H-NMR and GPC (polydispersity index < 1.15) 


measurements. 


£Volume fractions Va, Vg and Ve were calculated from molar volumes and degrees of polymerization. 
§ For calculations see Methods. The diameter (d) and width (w) of cylindrical segments of ABSA units and (ABCA), segments are average values from TEM image analysis. 
|| Average of 250 measured particle core radii in TEM image analysis. 
Particle core volumes calculated from measured core radii. 
#Average aggregation number of polymer chains per particle. 
xrCalculated by applying the relation Vs/Vg. 
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Evidence for high salinity of Early Cretaceous sea 
water from the Chesapeake Bay crater 


Ward E. Sanford', Michael W. Doughten', Tyler B. Coplen’, Andrew G. Hunt? & Thomas D. Bullen? 


High-salinity groundwater more than 1,000 metres deep in the 
Atlantic coastal plain of the USA has been documented in several 
locations’”, most recently within the 35-million-year-old Chesapeake 
Bay impact crater* °. Suggestions for the origin of increased salinity 
in the crater have included evaporite dissolution’, osmosis® and 
evaporation from heating’ associated with the bolide impact. Here 
we present chemical, isotopic and physical evidence that together 
indicate that groundwater in the Chesapeake crater is remnant Early 
Cretaceous North Atlantic (ECNA) sea water. We find that the sea 
water is probably 100-145 million years old and that it has an 
average salinity of about 70 per mil, which is twice that of modern 
sea water and consistent with the nearly closed ECNA basin*. Previous 
evidence for temperature and salinity levels of ancient oceans have 
been estimated indirectly from geochemical, isotopic and palaeon- 
tological analyses of solid materials in deep sediment cores. In con- 
trast, our study identifies ancient sea water in situ and provides a 
direct estimate of its age and salinity. Moreover, we suggest that it is 
likely that remnants of ECNA sea water persist in deep sediments at 
many locations along the Atlantic margin. 

Hypersaline groundwater in the deep Atlantic Coastal Plain of North 
America was discovered in several locations decades ago. Off the coast 
of Georgia, hypersaline brines (>100 g kg ') associated with anhydrite 
deposits exist at depths exceeding 1.2 km in Lower Cretaceous strata’. 
On Cape Hatteras, North Carolina, drilling in 1946 recovered brine 
from a depth of 2.0 km with a salinity of 117gkg ' (ref. 9). In south- 
eastern Maryland, actual and estimated chloride concentrations of 
40 gkg ‘and50gkg ' were reported” at 1.0 km and 1.2 km. In southern 
New Jersey, at 1.0 km, a chloride concentration was observed at 25 gkg 
(ref. 2). Observations have also indicated that the position of the fresh- 
water-saltwater transition zone extends tens of kilometres farther 
inland near the southernmost Chesapeake Bay''. This ‘inland wedge’ 
was eventually recognized as being collocated with the Chesapeake 
Bay impact structure*®, and hypersaline water was detected there at 
Kiptopeake, Virginia, at a depth of 0.6 km (Fig. 1)*. Two original the- 
ories for the origin of the hypersalinity at the crater—evaporite dis- 
solution and reverse osmosis°—have now been dismissed because of 
the lack of proximal evaporites and the necessary hydrogeologic con- 
ditions, respectively”. 

The Chesapeake Bay impact structure was created by a bolide 
impact 35 Myr ago in the Late Eocene epoch*®. Deep drilling by the 
US Geological Survey (USGS) at Cape Charles, Virginia’*"*, and the 
USGS and the International Continental Drilling Program (ICDP) at 
Eyreville Neck, Virginia’*'®, recovered core and revealed a stratigraphic 
section documenting the sediment and bedrock disruption associated 
with the ‘wet-target’ (that is, the location had standing water, usually 
the ocean) impact. The test hole at Cape Charles", drilled in 2004, 
revealed post-impact sediments underlain by synimpact (that is, 
formed by the impact) sediment-clast resurge, crystalline-clast breccias 
and suevites down to the bottom of the hole at 823m. Wells at this 
site were installed at depths of 418m and 692m. The deep hole at 
Eyreville Neck, drilled in 2005, revealed post-impact sediments down 


to 444m, sediment-clast resurge breccias down to 1,096 m, a granite 
megablock down to 1,371 m, suevites, cataclasites and crystalline-clast 
breccias down to 1,551 m, and schist and pegmatite allochthonous 
blocks down to 1,766 m (Fig. 2a). Drilling did not reach the bottom 
of the excavated crater. The drill-stem casing was left in the hole down 
to 1,330 m, which allowed for later sampling and chemical analysis of 
formation water. 

Before data were retrieved from deep coring, impact heat and assoc- 
iated evaporation was suggested as a source of the elevated salinity. 
Simulations of the hydrothermal response to the impact suggested that 
hydrothermal and vapour phase conditions could exist during and 
following the impact’’. To investigate the thermal history of the crater 
and overlying sediments, vitrinite reflectance was measured on sediment 
cores from the Eyreville Neck and Cape Charles sites'*. Results showed 
a thermal signature of upward movement of groundwater in the 
10,000 years or so following the impact, but a heat balance based on 
the suevite temperature and the quantity of hypersaline groundwater 
in the crater demonstrated that there was inadequate heat to account 
for the excess salinity solely by steam production and loss’”. 
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Figure 1 | Map showing locations of Chesapeake Bay crater and coreholes. 
Map showing location of the Chesapeake Bay impact structure and sites of 
various deep coreholes and wells'®. The black and red circular lines mark the 
outer edges of the inner and outer craters. The black circle marks the location of 
the ICDP-USGS deep corehole and well’*"®. The red circle marks the location of 
the USGS corehole and wells at Cape Charles’**. The blue circle marks the 
location of the USGS corehole and wells at Kiptopeake*”. 
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Figure 2 | Chloride and isotope results from the deep ICDP-USGS corehole. 
Results from the ICDP-USGS corehole in the Chesapeake Bay impact crater 
near Eyreville, Virginia, including drilling depths and sediment types 
recovered (a), dissolved chloride concentrations (b), chloride/bromide mass 
ratios (c), oxygen isotope measurements of the pore water (d) and dissolved 
strontium and dissolved boron isotope measurements (e). Open circles are 
8’Sr/*°Sr mole ratio results, open squares are boron isotope results. The solid 


Pore water was extracted from cores at the Eyreville site over the 
entire 1,766 m. Ten-cm-long core samples were taken approximately 
every 10-20 m, and water was extracted from the core by high-speed 
centrifuge. Analyses were performed to determine the concentrations 
of major cations and anions (Extended Data Fig. 1), pH, alkalinity, and 
the isotopic compositions of hydrogen (8°H) and oxygen (8'°O) of 
water, dissolved boron (5!!B) and strontium (°’Sr/*°Sr), and sulphur 
(5°4S) and oxygen (5'°O) of dissolved sulphate’’. The dissolved chloride 
concentration at the site increases with depth continuously through the 
post-impact section, surpassing the seawater concentration well before 
the top of the crater-fill deposits (Fig. 2b). Concentrations continue to 
increase erratically with depth in the sediment-clast breccia until 
reaching approximately 38 g per litre (twice that of modern sea water) 
by 900 m. The Cl/Br mass ratio in pore waters at the site is relatively 
constant through the entire section, at a mean value of 262 (that in 
modern sea water is 292) (Fig. 2c). This rules out halite dissolution as a 
cause of elevated salinity”® and is indicative rather of an ancient sea 
water that is slightly depleted in chloride relative to bromide, consistent 
with the vast halite deposits created during the Jurassic and Cretaceous 
periods in the Gulf of Mexico and South Atlantic Basins. The 5'8O 
values of the pore water at the site increase with depth (Fig. 2d) in a 
pattern similar to that of chloride. Strontium isotope ratios are known 
to have varied substantially in sea water since the Mesozoic era (Early 
Cretaceous *”Sr/*°Sr mole ratios varied between 0.7073 and 0.7075)”', 
and thus might be a good indicator of ancient sea water, but samples 
analysed here reflect leaching of strontium” from continental-sourced 
Potomac Formation sediment clasts hosting the water (Fig. 2e). Boron 
isotopic compositions also suggest a strong water-rock interaction, 
because 5''B values differ substantially from sea water (11%o) (Fig. 2e) 
and are consistent with movement through and interaction with 


© Cl/Br mass ratio A 
16 24 32 40 200 300 400 4 2 0 2 


LETTER 


5'8O (%o) e = 87Sr/®6Sr mole ratio 


0.7090 0.7095 0.7100 0.7105 


: 
ie 
a |5 
: 


20 
3B (%e) 


red curved lines are simulation results assuming values of molecular diffusion 
D, and upward fluid flux q from compaction of 2 x 10 1! m? and 

3.5m Myr _", respectively. Additional details of the simulation methods can be 
found in Methods and Supplementary Information. Each water sample was 
analysed only once, and so in all figures each point on the graph represents a 
single analysis. 
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Figure 3 | Helium-based age estimates for the deep Chesapeake crater 
groundwater. Estimated groundwater ages in the mid-Atlantic coastal plain of 
North America, based on dissolved helium concentrations. Blue diamonds 
represent estimates of groundwater ages from wells sampled in the coastal plain 
sediments of Maryland”>. These ages are based on “He-accumulation rates 
calibrated against '*C-based and *°Cl-based ages calculated for a set of wells in 
the same geologic formation as the crater wells”. Red triangles represent 
estimates of age for the groundwater in the Chesapeake Bay impact crater 
sampled at the well near Eyreville, Virginia. These latter age estimates are a 
linear extrapolation using the “He-accumulation rates calculated from the 
Maryland samples. 
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Figure 4 | Plate-tectonic reconstruction of the ECNA ocean. Plate-tectonic 
reconstruction of the North Atlantic Ocean through the Early Cretaceous 
epoch® at 145 Myr ago (a), 130 Myr ago (b), 115 Myr ago (c) and 100 Myr 
ago (d). Red lines represent modern coastlines. Grey shapes represent 
continental plates and fragments. The open circle on the Atlantic coast of North 
America is the site of the late Eocene (35-Myr-old) Chesapeake Bay impact 


igneous rocks in the crater and/or pre-impact crystalline basement 
(Extended Data Fig. 2). 

The curvature of the chloride and 5'°O profiles suggests diffusion and 
slow upward advection as controlling processes’. One-dimensional 
transport simulations were carried out to test this hypothesis. The 
advection-diffusion equation was solved with an upper boundary con- 
dition that allowed for post-impact sequences of sediment to be depos- 
ited over time and for the upper boundary salinity to alternate between 
that of fresh water and salt water, depending on the inferred position of 
the coastline from the known depositional environments and hiatuses 
(Extended Data Fig. 3). The simulated chloride values could be made 
to match the observed values very closely in the upper 500 m of the 
section (Fig. 2) when a diffusion coefficient D,, of 2 < 10 ''m*s ! 
was used along with an upward fluid flux q of 3.5m Myr‘ and original 
chloride concentrations of 38 g per litre in the sediment-clast breccia 
section. Observed variable concentrations in the sediment-clast brec- 
cias lower than those simulated are attributed to clay clasts that are 
retaining brackish water'’. The 5'°O profile was also simulated with 
the same diffusion coefficient and upward fluid flux, and initial 880 
values (relative to the Vienna Standard Mean Ocean Water (VSMOW)- 
Standard Light Antarctic Precipitation (SLAP) standards) of -1.0% and 
-0.8%o in the coastal plain and sediment-clast breccias. The simulated 
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crater (CBIC). The maps are an attempt to delineate the edge of the continents, 
and do not show the continental regions that were covered by ocean water 
during the global Cretaceous transgression. They were constructed from a 
software program located at http://www.odsn.de/odsn/services/paleomap/ 
paleomap.html. 


5'%0 values also make a close match with the observed data down to 
500 m. Observed and simulated 5°’Cl values suggest possible isotopic 
effects related to the impact, but are also consistent with an ECNA 
origin for the pore water (Extended Data Fig. 4). 

The minor inflections in the Cl and 8'*O data can be replicated only 
by including sedimentation and temporal salinity changes at the upper 
boundary. The overall goodness-of-fit to these simulations (Extended 
Data Fig. 5) is evidence that the crater fill has been undisturbed by 
lateral groundwater movement since the time of impact, with the 
solutes being influenced only by diffusion and advection consistent 
with burial and compaction of fine-grained sediment*’. The initial 
condition of 38 g per litre for the chloride concentration simulation 
represents water that was trapped in and below the coastal plain sedi- 
ments before the bolide impact occurred. Simulation of chloride con- 
centrations in the pre-impact sediments confirms that most of the pore 
water would have remained undisturbed between the end of the Early 
Cretaceous period and the time of impact (Extended Data Fig. 6). The 
impact disrupted the sediment and basement crystalline rocks, along 
with their pore water, redepositing them in the crater. Because much of 
the crater fill is composed of clasts many metres in diameter, most of 
the initial pore water would have been retained through the collapse and 
resurge phases after impact’®. Vertical pore-fluid flux due to loose-sediment 
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compaction immediately following the resurge was estimated to be 
around 30mkyr ' (ref. 18). But the much slower 3.5m Myr ' flux 
over the intervening 35 Myr indicated by the chloride and 5'°O profile 
is consistent with loading rates during post-impact sedimentation. 

Dissolved gas samples were collected from the deep wells at Cape 
Charles and Eyreville. At Eyreville, the well casing in corehole B was 
left in situ at the very top of the granite megablock(s). Helium con- 
centrations and isotope ratios (R/R,) for deep Cape Charles and 
Eyreville groundwater were found to be 5.0 X 10-*cm*g™! at STP 
(standard temperature and pressure) and 4.0 X 10-*cm’g | at STP, 
and 0.44 and 0.04, respectively. The low R/R, number at Eyreville 
suggests that this sample’s helium concentration represents predomi- 
nantly in situ helium accumulation over time’, and so we used that 
concentration to calculate a He-based groundwater age. From recent 
estimates of *He accumulation rates in the coastal plain sediments”, 
the helium-age of the saline groundwater would be 180 + 80 Myr 
(Fig. 3) given the variability in accumulation rates and the extrapola- 
tion to the much older age. The age can be further constrained to the 
Early Cretaceous (145-100 Myr ago), because the Lower Cretaceous 
Potomac Formation is the oldest local sediment, resting directly on the 
crystalline basement under the coastal plain (pages 19 and 20 in ref. 
26). Given that the depositional environment of the Lower Cretaceous 
Potomac Formation is coastal fluvial and deltaic, the water in the 
Lower Cretaceous Potomac Formation and the upper crystalline base- 
ment would have been replaced with Early Cretaceous North Atlantic 
(ECNA) sea water during the global Cretaceous transgression. 

For the North Atlantic Ocean, the Cretaceous period was a time of 
transition from a closed rift basin to an open ocean (Fig. 4). During the 
Jurassic period, the early North Atlantic must have had a very high 
salinity, evidenced by the fact that it was depositing salt as evaporite 
deposits around its border’’. Rifting that would lead to the South 
Atlantic Ocean was only beginning during the Early Cretaceous. As 
small openings to the ECNA waxed and waned and eventually became 
wider during the Early Cretaceous, the salinity would have fluctuated 
but slowly declined overall from highly saline, eventually reaching a 
near-modern value. Anoxic events recorded in the ECNA suggest a 
lack of basin-wide circulation?*, and some 5!°O evidence from fora- 
minifera in cores suggest a salinity higher than that of modern sea 
water”. By the end of the Early Cretaceous, the openings, especially 
to the South Atlantic, were becoming substantially wider (Fig. 4d). 
Even a small opening could lead to substantial dilution of salinity. 
As a modern example, the only opening to the Mediterranean Sea is 
the Strait of Gibraltar, yet its salinity (38%0o) is only about 10% above 
that of modern ocean salinity. By the beginning of the Late Cretaceous 
(100 Myr ago), North Atlantic salinity was probably near that of mod- 
ern sea water. 

We conclude that the saline water currently present in the Chesapeake 
Bay impact crater’s resurge breccia and crystalline blocks is a remnant 
of ECNA sea water, and that the ECNA had an average salinity of about 
70%o, or twice that of modern sea water. In addition, given that several 
other locations along the North American Atlantic Coastal Plain have 
salinities greater than that of sea water at depth, we believe remnants of 
ECNA sea water probably persist at many other locations along the 
Atlantic margin as well. 


METHODS SUMMARY 


At the Eyreville ICPD-USGS site, 10-cm-long sections of core were removed every 
10-20 m for pore-water analysis. Most of the unconsolidated cores removed for 
pore-water analysis were bisected, and one half was returned for archival purposes. 
A few millimetres of the outer rind of each sample were removed with a knife to 
remove visible drilling mud from the outside of the core and any that may have 
invaded the outer edge of the core. Core sections were then placed in glass jars that 
had been flushed with nitrogen, sealed and refrigerated at 10 °C. 

The core samples were placed in one of two USGS high-speed centrifuges in 
Reston, Virginia, and Denver, Colorado, to extract pore water. Once extracted, the 
pore-water samples were analysed for pH, alkalinity and specific conductance’ by 
using Hach small-volume test kits. The remaining water was then separated into 
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vials for various analyses and refrigerated. Analyses for cations and anions and the 
stable isotopes of water were analysed in USGS laboratories in Reston, Virginia, by 
ion chromatography and inductively coupled plasma-optical spectroscopy, respect- 
ively. Hydrogen and oxygen isotopes of water were measured, respectively, by 
gaseous hydrogen and carbon dioxide equilibration and dual-inlet isotope-ratio 
mass spectrometry. Strontium and boron isotope ratios were measured in a 
USGS laboratory in Menlo Park, California, by inductively coupled plasma mass 
spectrometry. 

For helium in the deep wells, water samples were retrieved by using a 6-litre, 
stainless steel Kemmerer sampling flask attached to a portable cable winch fitted 
with a digital depth counter. After being lowered to near the bottom of the well 
casing, the Kemmerer flask was closed by a sender weight and retrieved. Dissolved 
helium and helium isotope ratios were measured in the USGS Noble Gas Laboratory 
in Denver, Colorado, using a capacitance manometer, and a MAP-215-50 mass 
spectrometer, respectively. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Core sample collection. At the Eyreville ICPD-USGS site, 10-cm-long sections of 
core were removed every 10-20 m for pore-water analysis. Cores from core hole A 
were 8.5cm in diameter from 126 m to 591 m and 6.35 cm from 592 m to 941 m, 
whereas those from core hole B were 6.35 cm from 738 m to 1,101 m and 4.76 cm 
from 1,101 m to 1,706m. Most of the unconsolidated cores removed for pore- 
water analysis were bisected, and one half was returned for archival purposes. All 
core split samples were collected before the cores were washed. A few millimetres 
of the outer rind of each sample were removed with a knife to remove visible 
drilling mud from the outside of the core and any that might have invaded the 
outer edge of the core. Core sections were then placed in glass jars that had been 
flushed with nitrogen, sealed and refrigerated at 10 °C. The jarred samples were 
then transported and stored at a USGS laboratory in Reston at 5°C until pore- 
water extraction. 

Core sample extraction. The core samples were placed in one of two USGS high- 
speed centrifuges in Reston, Virginia, and Denver, Colorado, to extract pore water. 
Samples were removed from the jars and pulverized, and subsamples were spun at 
room temperature at 40,000 revolutions per minute (r.p.m.) for about 30 min. 
Once collected, the pore-water samples were analysed for pH, alkalinity and 
specific conductance” by using Hach small-volume test kits. The remaining water 
was then separated into vials for various analyses and refrigerated. Water and 
solute analyses included major anions and cations, 8H and 8'°O of water, 
87s1/®°Sr of dissolved strontium, dissolved organic carbon, and 5°45 and 5!80 of 
sulphate’. To ensure the samples were not contaminated by drilling fluids, 
selected samples were also analysed for polar-organic compounds that were char- 
acteristic of the drilling fluids*'. Drilling fluids were not detected in any of the 
samples. 

Dissolved ion and isotope methods. Analyses for cations and anions and the 
stable isotopes of water were performed in USGS laboratories in Reston, Virginia. 
The anions and cations were analysed by ion chromatography and inductively 
coupled plasma-optical spectroscopy, respectively. Hydrogen and oxygen isotopes 
of water were measured, respectively, by gaseous hydrogen” and carbon dioxide 
equilibration** and dual-inlet isotope-ratio mass spectrometry. Sulphur and oxy- 
gen isotopes of dissolved sulphate were analysed by conversion of sulphate to 
barium sulphate™ and analysis by continuous-flow isotope-ratio mass spectro- 
metry*. Strontium and boron isotopes were measured in a USGS laboratory in 
Menlo Park, California, by inductively thermal ionization mass spectrometry”, 
and negative thermal ionization mass spectrometry”, respectively. Chloride stable 
isotopes were analysed at the University of Chicago by continuous-flow isotope- 
ratio mass spectrometry””. 

Helium sample collection. In the deep wells at both Eyreville and Cape Charles, 
water samples were retrieved by using a 6-litre, stainless steel Kemmerer sampling 
flask attached to a portable cable winch fitted with a digital depth counter. The 
deep wells were first purged to ensure that pristine formation water was in the well 
casing. After being lowered to near the bottom of the well casing, the Kemmerer 
flask was closed by a sender weight and retrieved. The dissolved gas samples were 
then transferred to a 0.30-m length of 1-cm-diameter copper tubing, first allowing 
the water sample to flow from the Kemmerer sampler through the tubing. Once 
gas bubbles were purged from the tubing, it was crimped on both ends and sealed 
with refrigerator clamps. 

Helium measurement methods. Dissolved helium and helium isotope ratios 
were measured in the USGS Noble Gas Laboratory in Denver, Colorado. The 
dissolved gases were separated from the formation water in the laboratory and 
analysed in an ultrahigh-vacuum extraction system. The total pressure of the 
extracted gases was measured on a capacitance manometer. Helium was separated 
from the sample gas using a cryogenic cold trap and analysed for isotopic com- 
position statistically on a MAP-215-50 mass spectrometer™. 

Helium data interpretation. Helium accumulation rates were not estimated or 
calibrated in this study, but were calibrated against 4C and °°Cl abundances in 
another recent study in the near vicinity”*. The calibration results from that recent 
study allowed us to make an estimate of the age using helium. We acknowledge 
that the estimate is a large extrapolation from the measured ages and thus repre- 
sents an estimate that is far from precise (our error bars are 80 Myr). The validity of 
the extrapolation is supported, however, by the fact that the helium from the 
calibration study was extracted and measured from a section of the Potomac 
Formation very similar to that from which the helium in this study was extracted. 
In this respect, ‘deep crustal’ contributions of helium that might be present would 
have contributed to the calibrated and extrapolated He values in a similar fashion. 
As additional evidence, deep sources of helium often have a mantle (He) sig- 
nature. We measured *He/*He ratios (R/R,) (reported in the main text of this 
article) and found the value at the deep drill site to be close to zero, suggesting that 
the helium was solely from crustal in situ decay, with a negligible mantle source. In 
contrast, the helium at the centre of the crater had a value of R/R, substantially 
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above zero, perhaps influenced by a deeper source originating through deep crustal 
fractures that were generated during the impact. In general, we consider the 
helium-based age to be one piece of circumstantial evidence in this study. It is 
not proof of the age, but it is consistent with the other pieces of evidence. The age is 
further constrained to be not older than Early Cretaceous (by the age of the 
sediment) and not younger (because the North Atlantic then became hydrologi- 
cally open). 
Strontium and boron isotopes. Rock-water interaction is indicated by observing 
a combination of the dissolved strontium and boron isotopes (Extended Data 
Fig. 2). The values do not represent modern sea water—especially not boron 
(1'B), which has a modern seawater value of approximately +40 per mil relative 
to NIST SRM 951 boric acid. The data do trend, however, in the direction of water— 
rock interaction with igneous or igneous-derived sediments, both of which have 
typical values that could easily fall in the range indicated by ‘Pre-impact basement 
igneous rock’ in Extended Data Fig. 2. Mixing with modern sea water is also 
suggested by the isotopes, which is consistent with the 35 million years of diffusion 
that has occurred between the ECNA and modern sea water in the crater. 
Concentration profile simulation. The process of molecular diffusion and upward 
advection of groundwater was simulated by solving the advection-diffusion equa- 
tion for the flow of groundwater in a porous medium”: 

eC 


oC oC 
moa 4 


oe (a) 
where D,, is the coefficient of molecular diffusion in dimensional units of [L”/T]; C 
is the concentration in [M/L*]; z is the vertical distance in [L], q is the specific 
discharge in [L/T], 9 is porosity, and L, T and M are the dimensions of length, mass 
and time, respectively. For the simulation of 8°’Cl profiles (see Supplementary 
Information), equation (1) must be solved for both 35Cl and °”Cl and the 8°’Cl 
values calculated from the two results”. 

Equation (1) was solved for the vertical concentration profiles of chloride and 
8180 in the crater from the land surface down to a depth of 1,400 m (Extended 
Data Fig. 3). A finite difference approximation method was used with a spatial 
discretization of 15 m, and time steps of 25,000 years. The finite difference equa- 
tion was solved in space and time explicitly using an Excel spreadsheet. The upper 
boundary condition represented the top of the sediment column as it accumulated 
over 35 million years since impact’. The chloride and 5'°O boundary conditions 
at the top of the profile were allowed to switch back and forth between freshwater 
and seawater conditions, depending on the environment at the time. The upward 
advective fluid flux q represents the upward movement of pore water being slowly 
expelled by compaction of the entire sediment column over time. In reality, the 
fluid flux would have varied with depth and over time, but here a single value was 
used as a simplifying assumption. This assumption was adequate to fit the data, 
and the additional complexity of variations in q was not justified here, given the 
scatter in the data that were being fitted. The evolving concentration profiles are 
shown in Extended Data Fig. 3. Porosity values were assigned that approximated 
those from the cores!°—0.45 from 0 m to 230m, 0.60 from 225 m to 345 m, 0.45 
from 345 m to 445 m and 0.3 from 445m to 1,400 m. As with the fluid flux, the 
porosity varied more with depth and over time, but this approximation was 
adequate to fit the data. 

The molecular diffusion coefficient and fluid flux value q were adjusted until a 
best fit was made with both profiles. The best-fit values and profiles for chloride 
and 5'°O are shown in Extended Data Fig. 5a, e, respectively. The uncertainty in 
these values was investigated by varying the boundary conditions and parameter 
values (Extended Data Fig. 5). Simulations without the change in the concentra- 
tion and sedimentation history at the upper boundary, or either, show poorer fits. 
The plots shown include adjusting the values of D,, and q to obtain the best fit 
possible. The values of D,,, and q were also adjusted upward and downward by 
factors of two, to show the sensitivity of the fit to the parameter. The sum of 
squared errors is listed for all simulations, and increases substantially for all of 
the alternative simulation conditions other than the global best-fit values. 
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Extended Data Figure 1 | Results from major cation and anion analyses. 
Results from the analyses of major cation and anion concentrations and their 
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given in Fig. 2. The vertical dashed black lines represent concentrations or 
values for modern sea water. For additional discussion see Supplementary 
ratios in the crater corehole pore waters'’. Chloride and bromide results are Information. 
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Extended Data Figure 3 | Detailed results from chloride and '*O transport _ change in 5'°O abundances relative to VSMOW (Vienna Standard Mean 


simulations. Results from the numerical simulation of chloride and 5'80 Ocean Water) with depth and time (c). Red lines are simulated values, and black 
diffusion and vertical advection. Shown are the transient change in the top diamonds shown in the 1-Myr and 0-Myr panels are observation data from 
model boundary condition over time, including the upward migration of the _ cores for comparison. Simulations use the best-fit parameter values of 
sediment-water interface and the changes in chlorinity (a), the simulated Dm = 2X10 '*m?s~! and q = 3.5m Myr '. For additional discussion see 
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Extended Data Figure 4 | Stable chloride isotope results. a, Major lithologies 
drilled at the Chesapeake Bay impact crater, along with observed 5°”Cl data and 
results of numerical simulations of 8°’Cl relative to SMOC (standard mean 
ocean chloride) using the best-fit parameters of the chloride and 8'8O 
advection—diffusion simulations (Fig. 2). Two conceptual models were 
investigated to explain the observed data, including isotopic fractionation using 
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Supplementary Information. 
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Extended Data Figure 5 | Uncertainty analysis for the chloride and '*O 
transport simulations. Plots showing model uncertainty relative to the most 
realistic boundary conditions and global best-fit parameter values for Cl (a) and 
5'80 (e). Alternative boundary conditions include no sedimentation history 
(b, f), no transient change in the upper boundary salinity (c, g), and neither of 
these (d, h) for both chloride (b-d) and 5’°O (f-h). Variations for the best-fit 
parameters including half (i, k, m, 0) and twice (j, 1, n, p) the best-fit values for 


both the coefficient of molecular diffusion (D,,) (i-I) and the magnitude of the 
upward fluid flux q (m-p). Filled and open circles represent those data points 
used in the sum-of-squared-errors (SSE) calculation, and those not used, 
respectively. Solid lines are simulation results. For plots i-p one of the two 
parameters is reset, and the other readjusted to a new best fit. For additional 
discussion see Methods. 
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Extended Data Figure 6 | Simulation of pre-impact chloride profile. 
Simulated chloride concentrations through the pre-impact coastal plain 
sediments. The black, red and blue lines indicate chloride concentrations at the 
end of the Early Cretaceous, at the end of the Cretaceous Period and at the time 
of the bolide impact, respectively. For additional discussion see Supplementary 
Information. 
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The earliest known holometabolous insects 


André Nel’, Patrick Roques!, Patricia Nel’? 
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Laure Desutter-Grandcolas', Torsten Wappler’, Romain Garrouste’, David Coty', Diying Huang®, Michael S. Engel”"'° 


& Alexander G. Kirejtshuk'"' 


The Eumetabola (Endopterygota (also known as Holometabola) plus 
Paraneoptera)' have the highest number of species of any clade, and 
greatly contribute to animal species biodiversity”*. The palaeoeco- 
logical circumstances that favoured their emergence and success 
remain an intriguing question®*®. Recent molecular phylogenetic 
analyses have suggested a wide range of dates for the initial appear- 
ance of the Holometabola, from the Middle Devonian epoch (391 
million years (Myr) ago) to the Late Pennsylvanian epoch (311 Myr 
ago””), and Hemiptera (310 Myr ago’’). Palaeoenvironments greatly 
changed over these periods, with global cooling and increasing comple- 
xity of green forests'*. The Pennsylvanian-period crown-eumetabolan 
fossil record remains notably incomplete’*”’, particularly as several 
fossils have been erroneously considered to be stem Holometabola’*”””" 
(Supplementary Information); the earliest definitive beetles are from 
the start of the Permian period”’”’. The emergence of the hymenop- 
terids, sister group to other Holometabola, is dated between 350 and 
309 Myr ago*””, incongruent with their current earliest record (Middle 
Triassic epoch)'”°. Here we describe five fossils— a Gzhelian-age stem 
coleopterid, a holometabolous larva of uncertain ordinal affinity, a 
stem hymenopterid, and early Hemiptera and Psocodea, all from 
the Moscovian age—and reveal a notable penecontemporaneous 
breadth of early eumetabolan insects. These discoveries are more 
congruent with current hypotheses of clade divergence. Eumetabola 
experienced episodes of diversification during the Bashkirian- 
Moscovian and the Kasimovian-Gzhelian ages. This cladogenetic 
activity is perhaps related to notable episodes of drying resulting 
from glaciations, leading to the eventual demise in Euramerica of 
coal-swamp ecosystems, evidenced by floral turnover during this 
interval”***. These ancient species were of very small size, living in 
the shadow of Palaeozoic-era ‘giant’ insects. Although these disco- 
veries reveal unexpected Pennsylvanian eumetabolan diversity, the 
lineage radiated more successfully only after the mass extinctions 
at the end of the Permian period, giving rise to the familiar crown 
groups of their respective clades. 

Newly discovered species permit a new inventory of the Pennsylvanian 
Eumetabola. These small insects are scarcely visible against the dark 
matrix, explaining why they were hitherto generally overlooked. The lack 
of fossil insects between the Late Devonian and the early Serpukhovian 
age (Hexapoda gap’) could be due in part to a failure to look for such 
diminutive remains, with most researchers hunting for the more famil- 
iar ‘giants’ of these periods. Informative descriptions and details on 
affinities of the new taxa are given in the Supplementary Information 
and Methods. 


Clade Holometabola 
Stem group to the hymenopterids 
Avioxyelidae fam. nov. 
Avioxyela gallica gen. et sp. nov. 


Referred material. Holotype Avion no. 3 (Avion, Pas-de-Calais, France, 
Moscovian) (Supplementary Information). 
Etymology. The genus name is a combination of the type locality 
Avion and the ‘primitive’ wasp genus Xyela. The name is feminine. 
The specific epithet refers to the ancient Roman name for France. 
Avioxyela is type genus for Avioxyelidae and A. gallica is type species 
for Avioxyela. 
Diagnosis. Forewing characters only. Subcosta posterior zigzagged and 
short, with cross-veins between it and the cubitus and radius; a very 
large ‘pterostigmal’ cell between branches of radius anterior (autapo- 
morphy); radius anterior 1 (RA1) and RA2 perpendicular to RA3 and 
RA4, and to anterior wing margin; radius posterior emerging from 
radius posterior and median posterior at right angle; radius posterior, 
median anterior and median posterior simple more or less parallel, 
with few large rectangular cells between them; a very small triangular 
cell just distal of the pterostigmal cell and a triangular radial cell near 
wing apex (synapomorphies with the hymenopterid lineage)”*; cubitus 
anterior long, independent of median vein, long cubitus posterior with 
several posterior branches and one cell between it and the posterior 
wing margin (Fig. la and Supplementary Figs 1 and 2). 
Stem group to the coleopterids 
Stephanastidae fam. nov. 
Stephanastus polinae gen. et sp. nov. 
Referred material. Holotype MNHN.A.49011 (Gzhelian, Commentry, 
Allier, France). 
Etymology. The genus name is a reference to the Stephanian stage 
(also known as Gzhelian, originally named after Saint Stephen) and 
‘fellow citizen’ in Greek (‘astos’: astos or aste belongs to the astu (settle- 
ment) and likewise the polites or politis). The name is masculine. The 
species epithet is a matronym honouring Polina G. Kirejtshuk, daughter 
of one of the authors of the manuscript. Stephanastus is type genus for 
Stephanastidae and S. polinae is type species for Stephanastus. 
Diagnosis. Forewing subcosta simple; radius posterior and median 
simple; concave median and convex cubitus anterior with a common 
stem separated from radius (synapomorphies with the coleopterids); 
very long cubitus posterior reaching distal eighth of forewing; anal veins 
very short at base of forewing (autapomorphies); forewings sclerotized 
and longer than abdomen, with very fine and dense isodiametric cells 
forming regular rows generating intercalary veins along anterior edge, 
more diffuse near posterior margin; no adsutural line (sub-marginal 
anal vein along the whole posterior edge), medial edge of forewings not 
forming straight adjoining margins, apices not meeting apically; very 
large area between cubitus and radius; coxae not projecting, trochan- 
ters apparently short; lateral prothoracic carina absent (no prohypo- 
mera) (Fig. 1b-d and Supplementary Fig. 3). 
Clade Holometabola 
Metabolarva bella gen. et sp. nov. 
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Figure 1 | Representatives of stem hymenopterids and stem coleopterids. 
a, Avioxyela gallica gen. et sp. nov. imprint. b-d, Stephanastus polinae gen. et sp. 
nov. b, General habitus (white and back arrows correspond to the veins of the 
two forewings). c, Reconstruction of habitus (numerous intercalary 
longitudinal pseudo-veins removed for clarity). d, Enlargement of the elytra 
base (black box in b). A, anal vein; C, cubitus; CuA cubitus anterior; CuP 
cubitus posterior; cua-cup, cubital brace; M, median; PC, precosta; R, radius; 
RA, radius anterior; RP, radius posterior; ScP, subcosta posterior. Scale bars, 
1mm (a), 10mm (b). 


Referred material. Holotype MAS Pal 588 (Moscovian, Piesberg, 
Lower Saxony, Germany). 
Etymology. The name of the new genus is formed from the term 
‘metabolism’ and Latin ‘larva’ (immature active stage of ontogenesis). 
The name is feminine. The species name bella is in reference to the 
good state of preservation. Metabolarva bella is type species of the 
genus Metabolarva. 
Diagnosis. Larviform animal of eruciform oligopod type; prothorax 
with paired wide sclerotized plates narrowly separated along midline; 
abdomen membranous, apparently with eight segments; probable stig- 
mal plate bearing a pair of spiracular orifices; distal preapical abdomi- 
nal segments bearing in total four pairs of thick appendages forked into 
two (anterior and posterior) processes (Fig. 2a, b, and Supplementary 
Fig. 4). 
Clade Paraneoptera 
Order Hemiptera 
Family Aviorrhynchidae fam. nov. 
Aviorrhyncha magnifica gen. et sp. nov. 
Referred material. Holotype Avion no. 2 (Avion, Pas-de-Calais, France, 
Moscovian) (Supplementary Information). 
Etymology. The generic name refers to the type locality Avion, and 
rrhyncha refers to the common suffix of related suborders (for example, 
Sternorrhyncha, Auchenorrhyncha). The name is feminine. The species 
epithet refers to the wonderful state of preservation of the holotype. 
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Sclerotized plates 


Head 


Figure 2 | Holometabolous larva and representatives of stem Psocodea and 
Euhemiptera. a, b, Metabolarva bella gen. et sp. nov. a, Photograph. 

b, Reconstruction of habitus. c-e, Aviorrhyncha magnifica gen. et sp. nov. 

c, Reconstruction of forewing. —, concave veins; AA1 + 2, first anal anterior 
vein; AA3 + 4, second anal anterior vein. Short lines outside of the wing 
illustration mark the limits of different veins along the wing margin. 

d, Detail of wing base. e, Detail of anterior margin. f, Westphalopsocus pumilio 
gen. et sp. nov., forewing. PC, precosta. Scale bars, 1 mm (a, ¢, f). 


Aviorrhyncha is type genus for Aviorrhynchidae and A. magnifica is 
type species for Aviorrhyncha. 

Diagnosis. Forewing with a cua-cup vein, long concave precosta, very 
short concave subcosta posterior fused with radius near wing base, re- 
emerging distally, a fringed ambient vein (synapomorphies with Para- 
neoptera or Euhemiptera); a series of veinlets between radius and anterior 
ambient vein; areola postica with long simple branches; simple cubitus 
posterior and first anterior anal (AA1 + 2) fused near wing base and 
separating again distally (Fig. 2c-e and Supplementary Figs 5 and 6a). 

Order Psocodea 
Family Westphalopsocidae fam. nov. 
Westphalopsocus pumilio gen. et sp. nov. 


Referred material. Holotype Avion no. 4 (Avion, Pas-de-Calais, France, 
Moscovian) (Supplementary Information). 

Etymology. The generic name refers to the Westphalian stage and Psocus, 
common stem of many generic names in the Psocodea. The name is 
masculine. The species epithet refers to the very small size of the species. 
Westphalopsocus is type genus for Westphalopsocidae and W. pumilio 
is type species for Westphalopsocus. 

Diagnosis. Forewing narrow elongate with parallel margins, with a cua- 
cup cross-vein; two anal veins fused apically; cubitus anterior forming an 
areola postica; subcosta posterior distally fused with radius anterior and 
re-emerging near its apex; a pterostigmal cell (characters of Psocodea) 
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between apices of subcosta posterior and radius anterior; a series of cross- 
veins between subcosta posterior and cubitus (autapomorphy); anterior 
branch of radius posterior emitting four short anterior branches; three 
posterior median branches; cubitus posterior simple; two anal veins 
coalescing and fusing apically for a distance (Fig. 2fand Supplementary 
Fig. 7). 

These taxa alter current views on insect diversity during the Pennsyl- 
vanian, a critical period in hexapod diversification (Fig. 3 and Table 1). 
Avioxyela gallica is the earliest and first Carboniferous species that can 
be assigned to the stem lineage of the hymenopterids, one of the three 
main ecologically ubiquitous insect clades. The approximately 90-Myr 
gap between this fossil and the earliest crown hymenopterid remains 
incongruent with the molecular dating of all principal hymenopteran 
clades from late in the Palaeozoic and early in the Mesozoic era’. Mole- 
cular studies have failed to account for some Paleozoic ‘mecopteroids’, 
which are probably more closely allied to A vioxyela and the hymenopterids. 
The previous discovery of the Bashkirian panorpidan Westphalomerope 
indicated that the stem Neuropteroidea (the coleopterids plus Neuro- 
pterida) is of coeval age’®. True Neuropterida and definitive tshekar- 
docoleid beetles are known in the lowermost Permian”*”. The Gzhelian 
Stephanastus is the oldest known representative of the stem group to the 
coleopterids (Coleoptera plus Strepsiptera), demonstrating that the 
Neuropterida are at least latest Pennsylvanian in age. The crown group 
Hymenoptera (common ancestor of Xyelidae and Neohymenoptera) 
perhaps originated during the Permian-Triassic from a grade of stem 
hymenopterids such as Avioxyela. Earlier alleged stem groups to the 
Holometabola are actually Polyneoptera (Supplementary Information), 
a group that is already well represented in the Pennsylvanian. The key 
innovation of the holometabolan larva was probably among the factors 
that purportedly fuelled their diversification. Such larvae must have 
already been present during the Pennsylvanian as evidenced by the 
presence of definitive holometabolans described herein. Given that 
the putative Moscovian larva Srokalarva bertei remains to be formally 
described’* (see Supplementary Information), the Moscovian Metabolarva 
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presented here is the oldest direct record of a holometabolous larva. It is 
likely to have had a metapneustic respiratory system with spiracular 
orifices at the anal end of the body, and to have lived in a semi-aquatic 
or moist environment. This eruciform larva is in accordance with the 
idea that this larval type may represent the groundplan for Holometabola, 
rather than the campodeiform type proposed earlier”. 

Pennsylvanian Paraneoptera are currently represented by the 
Serpukhovian hemipteran Protoprosbole, the Moscovian thripidan 
Westphalothripides’’, and the stem paraneopteran Bruayaphis'*. The 
Moscovian Westphalopsocus therefore represents the oldest evidence 
of Psocodea, extending the lineage back from earlier records based on 
the Permian Zygopsocus’*. The Moscovian Aviorrhyncha is the second 
Pennsylvanian hemipteran. Its proposed attribution to the Euhemiptera 
implies a separation from the Sternorrhyncha dating back at least to the 
Moscovian, older than suggested by molecular dating”. 

The new fossil material highlights the diversity of Pennsylvanian 
Eumetabola, involving stem groups to the major lineages Hymenopterida, 
Panorpida, Neuropteroidea, Thripida, Hemiptera and Psocodea. These 
discoveries are congruent with recent attempts to estimate divergence 
times based on sequence data, although those studies have widely failed 
to distinguish between the divergences of the stem groups, the crown 
groups of which may have appeared at different times and much later. 
These data are congruent with earlier evidence suggesting that the 
initial phases of the diversification of principal insect lineages, is much 
older than previously surmised’”’. Taking into account the presence of 
taxa such as Rhyniognatha hirsti in the Pragian’, the data seem to indi- 
cate that insects are likely to have originated around the Silurian- 
Devonian boundary and diversified between the Devonian and the 
early Pennsylvanian. The data suggest that the foundations of the even- 
tually hyperdiverse Holometabola, comprising most modern-day insect 
species, were already well established in the Pennsylvanian. 

The oldest Permian beetles and the earliest Devonian Pterygota were 
very small, considerably smaller than many of their descendants”. 
Except for the Gzhelian Stephanastus, all of the Pennsylvanian adult 
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Figure 3 | Phylogeny of main insect clades. Strudiella devonica Garrouste 
et al., 2012 (ref. 25) (1); wing of an archaeorthopteran, Prokop et al., 2005 (2); 
Westphalopsocus pumilio gen. et sp. nov. (3); Protoprosbole straeleni 
Laurentiaux, 1952 (4); Aviorrhyncha magnifica gen. et sp. nov. (5); 
Westphalothripides oudardi Nel et al., 2012 (ref. 15) (6); Avioxyela gallica 
gen. et sp. nov. (7); Metabolarva bella gen. et sp. nov. (8); Srokalarva berthei 


Kukalova-Peck, 1997 (9); Pteridotorichnos stipitopteri Labandeira and Phillips, 
1996 (10); Westphalomerope maryvonneae Nel et al., 2007 (ref. 16) (11); 
undescribed Neuroptera of Obora (Czech Republic) (12); Stephanastus polinae 
gen. et sp. nov. (13); Coleopsis archaica Kirejtshuk et al., 2013 (ref. 22) (14). 
Bashk., Bashkirian; Gzh., Gzhelian; Kas., Kasimovian; Mosc., Moscovian; 
Serpukh., Serpukhovian. 
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Table 1 | Carboniferous Paraneoptera and Holometabola 


Clade Genus Fossil Length Stratigraphic stage Locality Ref. 
Hemiptera Protoprosbole Wing 14mm Serpukhovian Forchie, Belgium 15 
Hemiptera Aviorrhyncha Wing 7.5mm Moscovian Avion, France Present paper 
Psocodea Westphalopsocus Wing 3.6 mm Moscovian Avion, France Present paper 
Paraneoptera Bruayaphis Wing 3.8mm Moscovian Bruay-en-Artois, France 15 
Thripida Westphalothripides Wing 74mm Moscovian Avion, France 17 
Holometabola (uncertain) | Baryshnyala Wing 9.6 mm Bashkirian Hagen-Vorhalle,Germany 19 
Panorpida Westphalomerope Wing 4.8mm Bashkirian Bruay-la-Bussiére, France 16 
Stem hymenopterid Avioxyela Wing 5.5mm Moscovian Avion, France Present paper 
Holometabola Srokalarva Larva 30mm Moscovian azon Creek, USA 17 
Holometabola Metabolarva Larva 9.1mm Moscovian Osnabrtick, Germany Present paper 
Holometabola Pteridotorichnos Gall 5mm Kasimovian Illinois basin, USA 18 
Stem coleopterid Stephanastus Body with wings 25.5 mm (wing) Gzhelian Commentry, France Present paper 


Eumetabola were also distinctively miniaturized, with wing lengths 
ranging between 3.8 mm and 14.0 mm, range 10.2 mm, in contrast to 
120 mm for Recent representatives (see Fig. 4). It would seem that these 
clades followed Cope’s rule”””’; that is, the tendency for organisms in 
certain lineages to increase in size over time. In addition, the Serpu- 
khovian Eugeropteridae, the most basal clade of the Odonatoptera, 
were of relatively small proportions, with wing lengths ranging between 
35 and 45 mm, whereas the ‘giant’ Meganeuridae (griffinflies) included 
the largest of all known insects with wing lengths between 50 and 
330 mm. These facts contradict the findings in a previous paper” in 
which it was supposed that this rule was not obeyed by insects of the 
Pennsylvanian. Although modern eumetabolous and non-eumetabolous 
insects have similarly smaller ranges of body sizes, this was not the case 
during the Pennsylvanian, as the known non-eumetabolous insects 
had wing lengths ranging between approximately 10 mm and 330 mm 
(Fig. 4). The presence of small, non-eumetabolous insects in the Mosco- 
vian of Avion (Supplementary Fig. 8) shows that their absence from 
other localities may be an artefact of biased collecting towards larger, 
showier hexapods. Conversely, the absence of larger Eumetabola during 
the Pennsylvanian is assuredly not an artefact. Only a few palaeopteran 
lineages were affected by gigantism, suggesting that this phenomenon 
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Figure 4 | Comparison of the sizes of Carboniferous fossil Pterygota and 
extant Pterygota. Illustrations of insects demonstrate the different clades 
(two are given for Palaeoptera and Polyneoptera of the Carboniferous, to show 
their diversity), their sizes are used to show approximately which are larger than 
others, but they are not to scale. Smallest extant insects are 0.25 mm long. 
Smallest carboniferous wings are 3.8 mm long (min. 3.8 mm). 
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was not solely due to the Pennsylvanian hyperoxic conditions that 
should have affected all clades”. It could be assumed that a combination 
of palaeoecological, physiological and environmental factors stimu- 
lated the growth of ‘giant’ insects among certain clades and the mini- 
aturization of the Eumetabola during the time interval. 

The ‘key innovation’ of metamorphosis, present in holometabolan 
larvae and some Paraneoptera, may have fostered the longevity of 
these taxa during the Pennsylvanian climatic oscillations” and the 
later Permian-Triassic extinction event®. The Holometabola, and the 
Paraneoptera to a lesser degree, experienced an extraordinary Triassic 
diversification, after the extinction of several ecologically dominant 
palaeopterous groups at the end of the Permian. Although holometa- 
bolans comprise some of the largest modern insects, their ancient proge- 
nitors apparently were very small. Much like the early, tiny mammals 
that skirted dinosaurian diversity many eons later, the Pennsylvanian 
Holometabola were dwarfed by but existed alongside the more ubiquitous 
and famous palaeopteran and polyneopteran insects of that bygone age’®. 


METHODS SUMMARY 


It was necessary to observe each small piece of rock under stereomicrospcope to 
detect the insect fossils. The specimens are provisionally housed at the Muséum 
National d’Histoire Naturelle, Paris, France, and were prepared using sharp spines. 
Photographs were taken with a Canon D550 digital camera with reverse lens MP-E 
65 mm, with dry specimens, and line drawings were prepared using a camera lucida 
on an Olympus SZX-9 stereomicroscope. Original photographs were processed 
using the image-editing software Adobe Photoshop CS. Some specimens were 
examined with a Tescan Vega LSU scanning electron microscope. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


It was necessary to observe each small piece of rock under a stereomicroscope to 
detect the insect fossils. The specimens are provisionally housed at the Muséum 
National d’Histoire Naturelle (MNHN), Paris, France, and were prepared using 
sharp spines. Photographs were taken with a Canon D550 digital camera with 
reverse lens MP-E 65 mm, with dry specimens, and line drawings were prepared 
using a camera lucida on an Olympus SZX-9 stereomicroscope. Original photo- 
graphs were processed using the image-editing software Adobe Photoshop CS. 
Some specimens were examined with a Tescan Vega LSU scanning electron micro- 
scope. Fossils from the Avion outcrop (Moscovian, Department of Pas-de-Calais, 
France) are in the P. Roques collection, currently stored in the Entomological 


Laboratory, MNHN, Paris, and to be deposited in the Musée Géologique Pierre 
Vetter, Decazeville, France. Standard wing venation nomenclature is followed 
for Hymenoptera’®, Coleoptera” and Paraneoptera’*. The diagnoses given in the 
main text are the same for the familial, generic and specific levels because all are 
presently monospecific. The Zoobanks LSIDs (Life Science Identifiers) for the new 
genera and species are as follows: Avioxyela gallica LSID, urn:lsid:zoobank.org:act: 
A9759D8D-785F-472B-A99A-9FE31622CCBC; Stephanastus polinae LSID, urn: 
Isid:zoobank.org:act:A75F53A2-0BE2-425D-97B4-33A572F20D73; Metabolarva 
bella LSID, urn:lsid:zoobank.org:act:7D1BE021-C4C3-4760-B8C7-6099454C2A8F; 
Aviorrhyncha magnifica LSID, urn:lsid:zoobank.org:act:41D4931A-57A9-42FD- 
8F01-7F909906782E; Westphalopsocus pumilio LSID, urn:lsid:zoobank.org:act: 
68C4E20D-1472-49FD-9826-24CE88A0CACE. 
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Extended Data Figure 1 | Representative of the stem hymenopterids, anterior; CuP, cubitus posterior; MA, median anterior; MP, median posterior; 
Avioxyela gallica gen. et sp. nov. a, Photograph of counterpart. R, radius; RA, radius anterior; RP, radius posterior; ScP, subcosta posterior. 
b, Reconstruction of forewing based on part and counterpart. CuA, cubitus Scale bar, 1 mm (a, b). 
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Extended Data Figure 2 | Representative of the stem hymenopterids, the second wing to show the identity of pattern with the first wing. d, e, Electron 
Avioxyela gallica gen. et sp. nov., electron scanning photograph. a, Two scanning photograph of radial vein of second wing with interpretation of the 
wings partly overlapping. b, Reconstruction of the two forewings. c, Mirror of _ veins. Scale bar, 1 mm (a, b, c). 
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Extended Data Figure 3 | Representative of the stem coleopterids, Stephanastus polinae gen. et sp. nov. Electron scanning photographs. a, Foreleg and 
pronotum. b, Mid-leg. Scale bar, 1 mm (a, b). 
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Extended Data Figure 4 | Holometabolous larva, Metabolarva bella gen. et sp. nov. Photograph of posterior half of body. Scale bar, 1 mm. 
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Extended Data Figure 5 | Representative of stem Euhemiptera, scanning photograph of counterpart. AA1 + 2, first anal anterior vein; 
Aviorrhyncha magnifica gen. et sp. nov. a, Photograph of part. b, Electron | _AA3 + 4, second anal anterior vein; PC, precosta. Scale bar, 1 mm (a, b). 
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Extended Data Figure 6 | Euhemiptera wing venation. a, Wing base of Aviorrhyncha magnifica gen. et sp. nov. b, Wing base ofa modern Fulgoroidea. Scale bar, 
1 mm (a, b). 
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Extended Data Figure 7 | Representative of stem Psocodea, Westphalopsocus pumilio gen. et sp. nov. a, Reconstruction of forewing. b, Photograph. A, anal 
veins. Scale bar, 1 mm (a, b). 
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Extended Data Figure 8 | Very small forewing of a dictyopteran from Avion. Specimen Avion 13. Scale bar, 1 mm. 
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Feature detection and orientation tuning in the 
Drosophila central complex 


Johannes D. Seelig! & Vivek Jayaraman! 


Many animals, including insects, are known to use visual land- 
marks to orient in their environment. In Drosophila melanogaster, 
behavioural genetics studies have identified a higher brain struc- 
ture called the central complex as being required for the fly’s innate 
responses to vertical visual features’ and its short- and long-term 
memory for visual patterns” *. But whether and how neurons of the 
fly central complex represent visual features are unknown. Here we 
use two-photon calcium imaging in head-fixed walking and flying 
flies to probe visuomotor responses of ring neurons—a class of 
central complex neurons that have been implicated in landmark- 
driven spatial memory in walking flies** and memory for visual 
patterns in tethered flying flies*. We show that dendrites of ring 
neurons are visually responsive and arranged retinotopically. Ring 
neuron receptive fields comprise both excitatory and inhibitory 
subfields, resembling those of simple cells in the mammalian prim- 
ary visual cortex. Ring neurons show strong and, in some cases, 
direction-selective orientation tuning, with a notable preference 
for vertically oriented features similar to those that evoke innate 
responses in flies’?. Visual responses were diminished during 
flight, but, in contrast with the hypothesized role of the central 
complex in the control of locomotion’, not modulated during 
walking. Taken together, these results indicate that ring neurons 
represent behaviourally relevant visual features in the fly’s envir- 
onment, enabling downstream central complex circuits to produce 
appropriate motor commands’. More broadly, this study opens the 
door to mechanistic investigations of circuit computations under- 
lying visually guided action selection in the Drosophila central 
complex. 

Flies display a variety of visual pattern- and position-dependent 
behaviours, including stripe fixation’, short-term orientation memory’, 
pattern learning’ and place learning’. Common to these behaviours is a 
need to detect and respond to specific features in the insect’s visual 
surroundings. In addition, all these behaviours require the central 
complex’, a deep brain region that has also been implicated in motor 
control®. We used two-photon calcium imaging in genetically targeted 
populations of central complex input neurons in behaving flies to 
investigate their potential visuomotor role. We focused on the dendritic 
responses of ring neurons—neurons that connect the lateral triangle 
(LTR) to the ellipsoid body’ (Fig. 1a) and have been specifically 
implicated in visuomotor memory”. 

Electron microscopy in the locust has shown that dendrites of ring 
neuron analogues arborize in specialized structures in the LTR called 
microglomeruli, where they are contacted by axonal projections from 
visual areas’®. Confocal images of the Drosophila LTR labelled with 
green fluorescent protein (GFP) under the control of a pan-neuronal 
driver line, R57C10 (ref. 11), revealed a similar dense microglomerular 
substructure in the region (Fig. 1b, Supplementary Videos 1-4). 

We asked if LTR microglomeruli respond to visual input. We used 
two-photon imaging with the calcium indicator GCaMP expressed 
pan-neuronally to record neural activity in the LTR of head-fixed 
Drosophila placed at the centre of a visual arena (Fig. lc, d). Flies 
were presented with small bright vertical bars moving horizontally at 


different elevations and LTR calcium transients were recorded from 
several planes of focus on one or both sides of the brain in a single 
experiment (see Methods). Calcium transients showed strong tem- 
poral correlations at the spatial scale expected of LTR microglomeruli 
(Fig. le, Supplementary Video 5 and Extended Data Fig. 1a). Visual 
stimuli evoked robust calcium transients in a subset of microglomer- 
uli, but only when the localized stimuli were in specific spatial loca- 
tions around the fly (Fig. 1f). We computed receptive fields for 
responsive microglomeruli and found that most receptive fields are 
centred in the ipsilateral visual hemifield (Fig. 1g-i, Extended Data 
Fig. 1b and Methods). Finally, LTR microglomeruli are clustered 
retinotopically and principal component analysis based on receptive 
field centres indicates that they form a spatial map with axes that are 
almost parallel to the fly’s visual field (Fig. 1j, k and Extended Data 
Fig. 1c-e). 

We next examined the anatomical relationship between the LTR 
and individual classes of ring neurons that send arborizations to the 
region®*”’. We studied dendritic arborization patterns of ring neurons 
targeted by EB1I-GAL4, which labels R2 ring neurons required for 
pattern memory’, and c232-GAL4, which labels R3/R4d neurons 
required for spatial memory~ (Supplementary Videos 1 and 2 for 
R3/R4d; Supplementary Videos 3 and 4 for R2). In agreement with 
past anatomical work’, different ring neuron classes arborize in spe- 
cific contiguous parts of the LTR (Extended Data Fig. 2a, b). Each ring 
neuron in these classes extends dendrites into a single microglomer- 
ulus in the LTR, and sends axonal arbors throughout a class-specific 
ring of the ellipsoid body (Fig. 2a, Extended Data Fig. 2c-e, see also”"”). 

To understand whether different types of ring neurons have dis- 
tinctive visual response properties, we mapped receptive fields for 
dendritic microglomeruli of R3/R4d and R2 ring neurons (Extended 
Data Fig. 3a—-c and Extended Data Fig. 3d-f, respectively). We found 
visual responses in ~7/40 c232-GAL4-labelled microglomeruli— 
corresponding to ~7/20 R4d microglomeruli (Extended Data Fig. 
3a)—and ~14/20 of R2 microglomeruli labelled by EBI-GAL4. 
Receptive fields for R2 and R4d neurons cover large parts of the visual 
field, with highest density near the midline of the ipsilateral visual field 
(Extended Data Fig. 3g, h). In summary, R4d and R2 microglomeruli 
appear to have similar visual response properties and overlapping 
receptive fields, but with peak sensitivity in different parts of the visual 
field (Extended Data Fig. 3i-k). 

We next probed the fine structure of microglomerular receptive 
fields using sparse white noise stimuli (Fig. 2b, see Methods). 
Reverse correlation of microglomerular responses revealed prominent 
inhibitory subfields in the receptive fields (Fig. 2c for R2; Fig. 3a for 
R4d). The spatial scales of receptive field structure we observe is within 
the range for visual features that evoke strong innate responses in flies, 
and that are used for visual pattern learning in tethered flies*’. To test 
the validity of these white-noise-based receptive fields, we used them to 
predict responses to novel bar stimuli (Fig. 2d, see Methods). The 
predicted responses captured much of the temporal and spatial vari- 
ation in the data (Fig. 2e), with high correlations between estimated 
and actual responses (Extended Data Fig. 4a, b). 


1Janelia Farm Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, Virginia 20147, USA. 
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Figure 1 | Drosophila ellipsoid body ring neurons arborize in visually 
responsive LTR microglomeruli which show a retinotopic organization. 

a, Schematic of fly central brain showing antennal lobe (AL), mushroom bodies 
calyces (MB) and optic lobes along with sub-structures of the central complex: 
ellipsoid body (EB), fan-shaped body (FB), protocerebral bridge (PB) and 
noduli (NO). Inset shows EB ring neurons (R1-R4) and LTR. b, Frame of a 
confocal stack showing LTR microglomeruli labelled by pan-neuronal GFP 
expression. ¢, Projection of two-photon calcium imaging video of LTR with 
overlay of microglomeruli selected based on responses to moving visual stimuli. 
In individual planes of focus, 20 + 5 microglomeruli were delineated; over 
multiple planes of focus, 30 + 4 microglomeruli were delineated (n = 11 flies) 
(see Methods). d, Schematic of receptive field mapping set-up with fly 
positioned in centre of curved visual display. e, Sample frame from trial 


Noting that the receptive field structure of ring neuron inputs 
resembles those of simple cells in mammalian primary visual cortex’’, 
we next asked if these neurons share other response properties. Indeed, 
when we presented flies with a series of moving oriented bars, we 
found strong orientation tuning in microglomerular response patterns 
(Fig. 2f, g). As expected for receptive field structures with both excit- 
atory and inhibitory lobes, microglomeruli also showed orientation 
tuning when presented with bars of opposite contrast, that is, dark 
bars on a bright background (Extended Data Fig. 5a, b), as are often 
used in fly behavioural studies”. Examining orientation tuning across 
the population of microglomeruli, we observed a strong preference for 
vertically oriented bars (Fig. 2h and Extended Data Fig. 4h, i for R2; 
Extended Data Fig. 4c, k, 1 for R4d; Fig. 2k and Extended Data Fig. 4g, 
n, o for the pan-neuronal line). Receptive fields tuned to vertical 
orientations are distributed across the visual field (Fig. 2i, j for R2; 
Extended Data Fig. 4c, d for R4d; Extended Data Fig. 4f for the pan- 
neuronal line). A fraction of neurons also shows direction-selectivity 
(Fig. 2) and Extended Data Fig. 4j for R2; Extended Data Fig. 4e, m for 
R4d; Fig. 2] and Extended Data Fig. 4p for the pan-neuronal line). 

We next assessed the degree of stereotypy in response properties of 
ring neuron dendrites of different flies. We found strong correlations 
across flies in receptive field structure for R4d (Fig. 3a, b, Extended 
Data Figs 6 and 8), and R2 (Fig. 3c, Extended Data Figs 7 and 8). 

Ring neurons and the ellipsoid body have often been ascribed a role 
in complex visuomotor tasks**. We examined the possible motor func- 
tion of ring neurons by assessing potential correlations between neural 
activity and locomotion in tethered flies walking on an air-supported 
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showing responses in selected glomeruli (red outlines). f, Calcium transients of 
three LTR microglomeruli in response to visual stimulus moving left to right in 
front of the fly at different elevations. g, Two-dimensional response maps (two- 
trial averages) for all microglomeruli shown inc. h, i, LTR microglomeruli from 
left and right hemisphere of same fly, coloured according to the centre of the 
receptive field in azimuth (h) and elevation (i). j, Histogram of correlation 
coefficients between receptive field centre and anatomical position. For n = 42 
focal planes with 20 + 5 glomeruli, correlation is significantly different than for 
randomly arranged microglomeruli (r = 0.27 + 0.17, P= 2.3X10°'7,n = 11 
flies), indicating retinotopy in the organization of microglomeruli across flies. 
k, Histogram of primary retinotopic axis of LTR map as found by principal 
component analysis (see Methods, n = 42 focal planes, 11 flies). All scale bars, 
5 um. 


ball’* or flying’, in darkness or in the presence of a bright stripe 
moving left or right in front of the fly (Fig. 4a, see also Methods and 
Supplementary Videos 6 and 7). Although some R3/R4d neurons did 
show occasional correlations with locomotion when flies walked in the 
dark, and responses from visually stimulated animals showed occa- 
sional modulation during walking, the changes were within the 
expected variability of visual responses in stationary flies (Fig. 4b-d). 
Responses were also insensitive to walking direction. Overall, LTR 
visual responses could be modelled accurately without taking walking 
state into account (Extended Data Fig. 9a-e and Extended Data 
Fig. 10). By contrast, responses to visual stimuli were consistently 
diminished during tethered flight (Fig. 4e, f, Extended Data Fig. 9f-h 
and Supplementary Video 7), but showed no obvious correlations with 
flight direction as assessed by differences in wingbeat amplitude envel- 
ope. Thus, ring neuron LTR responses were modulated by motor state, 
but not in a manner consistent with a direct role in motor coordina- 
tion, and in a markedly different manner than in the optic lobe’*"®. 
Behavioural genetics studies in Drosophila have suggested that the 
central complex is required for a wide range of important sensorimo- 
tor functions. However, in the absence of physiological recordings 
from the region in flies, it has been challenging to determine its role 
in the diversity of behaviours that it has been implicated in. We studied 
the visuomotor responses of ring neurons, which provide input from 
the LTR to the ellipsoid body of the central complex. Analogous neu- 
rons in other insects respond to polarized and unpolarized light’, 
and to mechanosensory stimulation’’, one of the sensory modalities 
that we did not explore but which may partly account for unresponsive 
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Figure 2 | Ring neurons are tuned to specific visual features and 
orientations. a, Multicoloured FLP-out of R2 neurons showing three cell 
bodies on each side along with their colour-matched microglomeruli (green, 
light green and purple at left; two green and one red at right). Red and yellow 
asterisks mark two cell bodies (one on each side, lateral) and arrows of like 
colour their respective LTR microglomeruli (medial). All neurons send 
processes throughout ellipsoid body rings. Scale bar, 30 um. b, Example frame 
of white noise stimulus used for receptive field mapping using reverse 
correlation. c, Sample receptive fields of R2 microglomeruli. Red subfields show 
excitatory responses >30% of maximum; blue subfields show inhibitory 
responses <30% of minimum of mean-subtracted weighted average. See 
Extended Data Fig. 8 for all receptive fields. d, Bright bars with four different 
orientations used as test features. e, Modelled and actual (black) AF/F changes 
of an R2 microglomerulus in response to differently oriented bars (fly 2 in 
Extended Data Fig. 7). In red is shown the trial used for fitting parameters (i), 
and the tests (ii-iv). f, Orientation tuning curves for R2 neurons (two-trial 
average, fit in red). The measurement 90° corresponds to back-to-front 
movement of the vertical bar, 270° to front-to-back movement of the vertical 
bar. Error bars represent standard deviation. g, Polar plots of orientation tuning 
data and fits (red) for data shown in f. Numbers italicized in bold indicate AF/F 
(%). h, Microglomeruli of R2 neurons coloured by orientation preference 
(collapsed to 0°-180°) and orientation selectivity (two-trial average). i, Same 
microglomeruli as in h coloured by azimuth and elevation of centre of their 
excitatory receptive fields, measured using horizontally moving bars as 
described in Extended Data Fig. 1 (two-trial average). j, Direction selectivity of 
same microglomeruli (two-trial average). k, 1, Preferred orientation (collapsed 
to 0°-180°) (k) and direction selectivity (1) of microglomeruli in pan-neuronal 
line (both four-trial average). See Methods for analysis details. Scale bars for 
hand k, 5 um. 


neurons in our study. We found that R2 and R4d ring neurons are 
visually responsive, and these responses are not significantly modulated 
by walking state. Although visual responses are diminished during 
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Figure 3 | Ring neuron LTR microglomeruli show stereotyped receptive 
field properties across flies. a, Subset of receptive fields measured in R4d 
neurons across seven flies aligned by similarity (see Extended Data Fig. 8b for 
full set). Numbers below receptive fields are cross-correlations with top 
receptive field in the column as template. b, Histogram of cross-correlation 
values calculated for R4d neuron receptive fields with best-matched template. 
c, Histogram of cross-correlation values for R2 neurons (n = 6 flies, see 
Extended Data Fig. 8a for receptive fields). 


flight, they do not vary systematically with wingbeat patterns associ- 
ated with turns. It is possible that outputs of these ring neurons within 
the ellipsoid body rings could be more sensitive to motor actions, but 
our physiological results in their dendrites during behaviour are incon- 
sistent with a major role for these ring neurons in motor coordination 
in the fly. Further, the high degree of stereotypy that we observe in ring 
neuron receptive field characteristics across flies indicates that, rather 
than directly performing motor coordination, these neurons probably 
provide downstream central complex circuits with similar behaviou- 
rally relevant visual feature sets on which to base motor decisions. As a 
striking example, the strong vertical tuning preference we observe in 
the LTR may partly underlie the tendency of flies to fixate on vertical 
edges during both flight and walking’**'. Recent work has suggested 
that vertical stripe fixation during walking largely relies on a hypothe- 
sized position system sensitive to local luminance changes that can 
operate independently of neural circuits involved in optomotor res- 
ponses to wide-field motion*’”. The response properties of R2 and 
R4d neurons is consistent with a role in such a position system”. 
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Figure 4 | Ring neuron visual responses are not significantly modulated 
during walking but diminished during flight. a, Set-up for two-photon 
imaging in behaving flies. Insets show schematic of fly tethered in flying holder 
and positioned on air-supported ball in walking holder. b, Subset of 
simultaneously recorded R2 neurons during walking. Boxes indicated with an 
asterisk show responses to identical visual stimuli when fly is stationary versus 
walking. Azimuthal position of visual stimulus shown in e (bottom). 

c, Distributions of R4d neuron visual responses during walking and 
non-walking conditions are not significantly different (n = 14 flies; 

trials yaning = 1,722; trialsaning <s0% = 42; MeaNwatking = 0 + 44.3%; 
MeEaNwalking <50% = 1.1 + 49.7%; P = 0.45). Percentage decrease in AF/F during 
walking was calculated as (AF/F — (AF/F) wan) /(AF/F)watk- d, Same as ¢ for R2 
neurons (n = 8 flies; trialswaning = 2,015; trialsyaning <50% = 245; 
MeaNwalking = 0 + 28.1%; MeaNwalking <50% = —2.2 + 24.4%; P = 0.37). 

e, Subset of simultaneously recorded R4d microglomeruli during flight. 

Boxes indicated with an asterisk show diminished responses to identical visual 
stimuli during flight. f, Distributions of all R4d microglomeruli recorded 
shows significant shift towards lower responses during flight 

(n = 13 flies; trialspying = 7595 trialsgying <som = 481; meaNgying = 0 + 42.2%; 
MeaNgying <50% = 31.2 + 72.3%; P= 6 X 10 *°). Percentage decrease in AF/F 
during flying was calculated as (AF/F — (AF/F)qy) /(AF/F)gy. All P-values show 
two-sample Kolmogorov—Smirnov test. 


The retinotopic bias, structure of excitatory and inhibitory subfields, 
orientation tuning and direction selectivity we see are reminiscent of 
those seen in calcium imaging studies in simple cells in mammalian 
visual cortex’’, providing an interesting example of how evolutionarily 
distant visual systems with different types of eyes nonetheless use 
similar feature sets to process visual scenes. From Hubel and Wiesel’s 
findings several decades ago”’ to the present, significant progress has 
been made on identifying neural representations used at different 
stages in the mammalian visual system. However, mechanisms under- 
lying simple cell responses are not yet fully understood”. With its array 
of genetic tools, Drosophila melanogaster may allow us to uncover how 
spatiotemporal interactions of excitatory and inhibitory inputs might 
produce similar orientation tuning and direction selectivity in the 
LTR*”*. There is considerable evidence for spatially tuned visual res- 
ponses in the lobula complex and anterior optic tubercle in other 
insects” °°, suggesting that components of ring neuron input response 
properties may also arise from selective averaging of weaker and more 
broadly distributed tuning preferences in such areas. 

Overall, our findings lay the groundwork for future research into 
how this genetic model organism’s small brain uses feature and pattern 
information for visual orientation and navigation. 


LETTER 


METHODS SUMMARY 


Calcium imaging experiments to measure receptive fields were performed with 
UAS-GCaMP3;c232-GAL4, UAS-GCaMP3;EB1-GAL4, and pJFRC7-20XUAS- 
GCaMP5.003 (VK00005)/R57C10-GAL4 flies. We used pGP-JFRC7-20XUAS- 
IVS-GCaMP6f 15.693(attP40)/R57C10-GAL4 flies for pan-neuronal orientation 
tuning experiments and pGP-JFRC7-20XUAS-IVS-GCaMP6s 15.641 (attP40)/ 
c232-GAL4 for orientation tuning experiments with dark bars. c232-GAL4 and 
EB1-GAL4 were gifts from M. Heisenberg and T. Lee, respectively. 

For calcium imaging experiments, female flies were mounted in custom stain- 
less steel shim holders with designs similar to those previously described’*”*. The 
dissection procedure, LED visual arena and two-photon imaging set-up were 
similar to those used in previous experiments". All data analysis was performed 
in MATLAB (MathWorks). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Fly stocks. All experiments were performed with female flies, with ages chosen 
based on expression levels of relevant fluorescent proteins. At least six animals 
were used for any single condition tested—specific sample sizes are noted for each 
set of experiments and were chosen based on the level of variability observed in 
initial experiments. Flies were randomly picked from their housing vials for all 
experiments. We used all data that passed a quality threshold based on the 
observed health of the fly during an experiment and the signal-to-noise ratio of 
the imaging signal. 

Calcium imaging experiments to measure receptive fields were performed with 
UAS-GCaMP3;c232-GAL4, UAS-GCaMP3;EB1-GAL4, and pJFRC7-20XUAS- 
GCaMP5.003 (VK00005)/R57C10-GAL4 flies. We used pGP-JFRC7-20XUAS- 
IVS-GCaMP6f 15.693(attP40)/R57C10-GAL4 flies for pan-neuronal orientation 
tuning experiments (2-5-days old) and pGP-JFRC7-20XUAS-IVS-GCaMP6s 
15.641 (attP40)/c232-GAL4 for orientation tuning experiments (1-2-weeks old) 
with dark bars. c232-GAL4 and EB1-GAL4 were gifts from M. Heisenberg and T. 
Lee, respectively. 

To label dendrites and axons in different ring neurons, EB1-GAL4 and c232-GAL4 
were each crossed to pJFRC67-3XUAS-IVS-Syt-GFP(attP 18)’, pJFRC118-10XUAS- 
DenMark(attP40)* and. pJFRC119-10XUAS-IVS-myr::TopHat2(VK00005) (gift of 
B. Pfieffer and R. Harris, unpublished). 

To compare the expression patterns of EB1-GAL4 and c232-GAL4 to the pan- 
neuronal line R57C10 in the same fly, EB1-GAL4 and c232-GAL4 were each 
crossed to pJFRC19-13XLexAop2-IVS-myr::GFP (su(Hw)attP8)”, pJFRC21- 
10XUAS-IVS-mCD8::RFP (attP18)**; Sco/Cyo; R57C10-LexA::p65 (attP2)™*. 

For stochastic single cell labelling of EB1-GAL4 and c232-GAL4 with three 
colours a ‘flip-out’-based approach** (Nern et al., manuscript in preparation) 
was used. In brief, heat-shock induced expression of FLP recombinase was used 
to excise FRT-flanked interruption cassettes from UAS reporter constructs car- 
rying HA, V5 and Flag epitope tags, and stained with epitope-tag specific anti- 
bodies. This labelled a subset of the cells in the expression pattern with a stochastic 
combination of the three labels. 

Anatomy: fly dissections, immunohistochemistry and confocal imaging. 
Confocal stacks were recorded with a X40 or X63 objective on a Zeiss confocal 
microscope. Dissections and staining were performed as previously described***’. 
The primary antibody mixture consisted of 1:1,000 sheep anti-GFP (AbD Serotec), 
1:1,000 rabbit anti-DsRed (Clontech), 1:100 rat anti-HA (for Syt/DenMark/HA 
staining, Roche), and 10% normal donkey serum (Jackson ImmunoResearch) in 
PBS-TX. 

Fly preparation for receptive field mapping. Receptive field measurements were 
performed using a preparation described previously’, but with the behavioural 
apparatus removed to maximize the fly’s visual field. The fly was briefly anaes- 
thetized on ice and transferred to a cold plate held at 4 °C. The proboscis of the fly 
was fixed by either pressing it into the head and fixing it with wax, or by stretching 
it with a pair of tweezers mounted on a micromanipulator and then fixing it with a 
mixture of wax and colophony. We additionally removed either the front legs or all 
legs for receptive field measurements. The fly was glued to a pin and positioned in 
the holder using a micromanipulator and fixed in the holder with UV gel'*. An 
opening was cut into the head to obtain optical access to the brain. To stop brain 
movement due to pulsation of muscle M16, we cut the muscle or the nerves 
innervating the muscles with dissection needles. The fly holder (including the 
micromanipulator) was then transferred to the microscope and mounted using 
magnetic mounts. Flies were dark adapted for 5 to 10 min before recordings 
started. 

Fly preparation for imaging during walking. For walking experiments, an air- 
supported ball was positioned under the fly with a three-axis micromanipulator as 
described previously’* and the walking velocity of the fly was monitored using a 
camera system. 

Fly preparation for imaging during tethered flight. For flying fly experiments, a 
holder similar to those described previously'*!*** was used. The holder was made 
out of two pieces of stainless steel shim (thickness 12.7 ym). The shim is cut using a 
laser mill and then folded into its pyramidal shape. One piece of the holder is glued 
onto a liquid chamber similar to the one used for walking behaviour using epoxy". 
After removing the front legs of the fly to prevent it from grabbing the holder, the 
fly is glued to a pin and inserted into the holder with a micromanipulator. The 
second piece of the holder is then inserted to close the pyramidal shape around the 
fly’s head. 

The setup for combined imaging and flight behaviour was similar to one prev- 
iously described’*. The fly was illuminated with infrared light from below. The 
wing beat of the fly was recorded using a mirror placed beneath the fly and a 
camera (Basler 602f, operating at 100 or 150 Hz frame rate)". 

Two-photon imaging. Calcium imaging was performed using a custom-built two 
photon microscope controlled with ScanImage”’. Fluorescence was detected using 
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a photomultiplier tube (H7422PA-40, Hamamatsu). We used an Olympus x40 
objective (LUMPlanFI/IR, numerical aperture 0.8) and typically adjusted the 
power to below 20 mW at the back aperture of the objective. We imaged at a frame 
rate of 6.7 Hz. Focal planes were selected based on the anatomy and visual res- 
ponses of microglomeruli. We focused on microglomeruli because this approach 
allowed us to distinguish different neurons in a labelled population (such distinc- 
tions are difficult to make in the ellipsoid body, where axons of different neurons 
arborize in close proximity). 

Visual stimulation: LED arena. Visual stimuli were presented using a curved 
visual display” that was covered with a colour filter to prevent cross-talk between 
fluorescence detection and visual stimulation as described". Additionally, to avoid 
reflections of stimuli from the curved surface of the display, we covered the display 
with a diffuser (tracing paper). Under such low-contrast conditions, we found that 
the signal-to-noise ratio (SNR) of calcium responses when stimulated by dark on 
bright-background stimuli (for example, dark bar on bright surround) was low. 
This motivated our preference for bright on dark-background stimuli, which 
produced higher SNR calcium responses (comparison in Extended Data Fig. 5). 

For receptive field measurements the display spanned 270° in azimuth and 120° 
in elevation. The top left and top right corners of the display (three square panels in 
each comer, each with a size of 30° by 30°) could not be seen by the fly (because 
they were occluded by the fly holder) and were excluded from the display. For 
behavioural experiments, a similar but smaller display was used, spanning 210° in 
azimuth and 90° in elevation. 

Visual stimulation: flashing dots for receptive field mapping. Excitatory 
receptive fields for c232-GAL4 and EB1-GAL4 flies were measured with stationary 
bright square dots (7.5° by 7.5°) appearing randomly in the visual field of the fly for 
1s followed by a dark period of 1s. The measurements were repeated until the 
entire visual field covered by the display was stimulated once. The display was 
sampled with a spatial resolution of the stimulus (7.5°). Measurements were 
performed in 8 blocks of 140s each, presenting a total of 468 stimuli covering 
the entire display. 

Visual stimulation: horizontally moving bars for receptive field mapping. We 
used bright bars (15° in elevation, 7.5° in azimuth) that moved at a speed of 30° s} 
left and right (in azimuthal direction) in the fly’s field of view. 

Two repetitions with stimulation in one direction were followed by two repeti- 

tions of movement in the opposite direction. To map responses over the entire 
visual field spanned by the display, the stimulus was shifted in steps of 15° in 
elevation from the top-most part of the visual display towards the bottom part and 
the left and right moving stimulation was repeated in each row. 
Visual stimulation: white noise. For white-noise stimulation*”*! we subdivided 
the display into squares of 11.25° by 11.25°. Each stimulus frame consisted of 20 
randomly selected bright squares whilst the remaining squares remained dark (ina 
few trials for one fly we used 30 squares). The stimulus appeared for one second 
followed by a dark period of one second. We presented 60 frames of random 
stimuli in a trial of 140s. For c232-GAL4 flies we used an average of 33 + 20 trials 
in 7 flies (59, 61, 29, 21, 21, 14 and 26 trials, respectively). For EB1-GAL4 flies we 
used an average of 41 + 13 trials in 6 flies, (38, 25, 39, 63, 35 and 47 trials, 
respectively). The colour scale (Fig. 2, Extended Data Fig. 8) was adjusted to give 
equal weight to excitatory and inhibitory subfields (mean set to 0). 

To validate white-noise-based responses, we stimulated the fly with bright bars 
of 56.25 degrees by 18.75 degrees oriented either vertically, horizontally or rotated 
by 45 or —45 degrees with respect to the vertical direction. To stimulate the 
receptive fields of all microglomeruli in an unbiased way, the bar stimuli were 
presented at a random position on the display and the display was sampled with a 
resolution of 11.25° in both elevation and azimuth. 

Visual stimulation for measuring orientation tuning curves. For orientation 
tuning curve measurement we used a stimulus that extended across the entire 
display, from 120° in azimuth for vertical orientation (90°) to 270° azimuth for 
horizontal orientation. The width of the bar was 15° and it was moved at a speed of 
75°s '. The angle was changed incrementally in steps of 11.25° starting with 
either horizontally or vertically oriented bars for one direction of movement, 
and then repeated in the opposite direction of movement. To obtain tuning curves 
with dark stimuli we inverted the contrast of bright and dark and removed the 
diffuser. 

Visual stimulation during walking and flying experiments. For walking and 
flying experiments we used a vertical bright bar spanning 90° in elevation and 15° 
in azimuth. The bar moved horizontally at a velocity of 15°s_'. The bar stayed 
stationary for 10s after moving in one direction for 17 s and then resumed moving 
in the opposite direction for 17s. 

Data analysis. All data analysis was performed in MATLAB (MathWorks). All 
errors and error bars shown are s.d. All P-values shown are based on t-tests, unless 
otherwise noted. 
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Frame alignment and movement correction. Data recorded using two-photon 
imaging were aligned in the xy plane on a frame-by-frame basis. Data were thre- 
sholded to distinguish arborizations in the lateral triangle from background. All 
above-threshold pixels were set to the same value. Frames were aligned by cross- 
correlating each thresholded frame to a single frame at the beginning of the 
measurement. Multiple trials were aligned by cross-correlation of trial-averaged 
frames thresholded as above. 

Calculation of fluorescence changes. The baseline for calculating AF/F was 
selected by averaging over the 10% of frames with lowest intensity in each trial 
or by using the baseline at the beginning of the experiment. Due to the low baseline 
intensity of GCaMP6, background fluorescence was not subtracted in measure- 
ments with GCaMP6. Calcium traces recorded from behaving flies were smoothed 
with a third order Savitzky-Golay filter over 7 frames for comparisons with 
behavioural data. 

Regions of interest selection. For receptive field measurements in stationary flies, 
regions of interest (ROIs) corresponding to microglomeruli were selected manu- 
ally in videos of AF/F. Overlapping parts of ROIs were excluded from further 
analysis. 

For experiments in behaving flies, regions of interest were selected using visually 
supervised k-means clustering (Extended Data Fig. 10a-d). We used a subset of 
three trials (2,820 frames) for clustering-based selection of ROIs. We then used 
correlation-based k-means clustering between the calcium traces in all thresholded 
pixels. The number of clusters was selected based on an estimate of the number of 
microglomeruli. If not all microglomeruli could be separated into different clus- 
ters, the number of clusters was increased in a second run. We then set a threshold 
to remove clusters that were smaller than a certain number of pixels (60). We 
additionally removed clusters that had an average cross-correlation value lower 
than a threshold (0.2). We further split anatomically disconnected regions of the 
same cluster and again removed those parts that were smaller than a size threshold 
(30 pixels). In a final check, the remaining ROIs were overlaid with the frames of 
the calcium video that showed the largest response in this ROI, and ROIs that did 
not correspond to microglomeruli were removed manually. 

Recpetive field mapping. Receptive fields (RFs) were smoothed using a Gaussian 
filter (4 pixels with a standard deviation of 1 pixel by default; 5 pixels with a 
standard deviation of 4 pixels for white noise measurements). 

For RF measurements (for example, in Fig. 1), we combined responses to left- 
and right-moving stimuli (each averaged over two trials). The onset of the calcium 
response was correlated with the bar’s movement into the RF. Due to the size of the 
RF, a bar moving from the right side of the fly towards the left side entered the RF 
from the right side and induced a calcium onset starting at the right side of the 
(excitatory part of the) RF. Similarly, a bar entering the receptive field from the left 
side induced a calcium onset starting at the left side of the RF. Due to the faster on- 
response than off-response of GCaMP, the onset of the RF was better defined than 
the offset. To find the centre of the excitatory RF measured with moving bars, we 
defined the RF centre as the weighted centroid of the average of responses to 
thresholded left- and right-moving stimuli (Extended Data Fig. 1b). This was 
equivalent to delimiting the RF by its calcium onset response. This procedure 
made the location of computed RF centres invariant to the kinetics of the calcium 
indicator. 

We used the following parameters to characterize the excitatory parts of RFs 
measured with single stationary dots (measured at 50% of the maximum AF/F 
response of each RF) (see Extended Data Fig. 7c): area, major axis (of an ellipse that 
has the same normalized second central moments as the region as determined with 
the MATLAB ‘regionprops’ function), minor axis, eccentricity, orientation (the 
angle between the horizontal x axis and the major axis of the ellipse, 0 corres- 
ponding to horizontal orientation), retinotopic correlation (the correlation coef- 
ficient between the centre of the RF, determined as the weighted centroid of the RF 
area and the centre of the corresponding ROI). 

Receptive field display. To display microglomerular RFs, we coloured light- 
emitting diode (LED) arena positions in proportion to the AF/F response elicited 
by the stimulus presented in that position. For moving stimuli, the calcium res- 
ponse in each pixel was determined as the average over all response frames that 
were recorded while the stimulus was at that position. Calcium responses were 
interpolated to account for mismatches between frame rates and movement of 
stimuli. For stimulation with stationary stimuli, the AF/F values shown in RF plots 
correspond to the maximum AF/F during the stimulation period. To prevent 
cross-talk between sequentially presented stimulus frames due to the slow decay 
of the calcium response (which extended beyond stimulus presentation) we only 
considered the calcium increase and not the calcium decay in assigning AF/F 
values to stimulus frames. 

Retinotopy. To assess retinotopy, we calculated the correlation coefficient 
between microglomerulus centres and centres of their RFs. This was compared 
to the correlation coefficient obtained after randomly shuffling correspondences. 


Additionally, we performed principal component analysis (PCA) on the (x, y) 

values of RF centres. The first PC gave us the direction of maximum variation of 
RF centres, which we consider to be the primary retinotopic axis (in the fly’s visual 
field) for the RF population. 
Orientation tuning. To measure orientation tuning (Fig. 2), single 15°-wide bars 
spanning the entire visual display were presented, and their orientation changed in 
steps of 11.25°. Responses from multiple trials were then averaged, and tuning 
curves were fit using the sum of two circular Gaussians: 


aye F(A) 1) 4g, «eke (cos (005) —1) 


in which a and a2 are amplitudes, 0, and 02 are the maximum angles and k; and k, 
are width parameters. 

The preferred orientation for each microglomerulus was the maximum of its 
fitted tuning curve. 
Orientation selectivity. The orientation selectivity index (OSI)“, was computed 
as the difference between the response in the preferred direction, AF/F,,a,, and the 
direction orthogonal to it (preferred direction + 90°), AF/Forthos normalized by the 
sum of the responses in the two directions: 


AF/Fmax — AF /Fortho 
AF /Fmax + AF /Fortho 


OSI 


Direction selectivity. The direction selectivity index” (DI) was calculated as 
the difference between responses in the preferred direction, AF/Fyax, and anti- 
preferred direction (preferred direction +180°), AF/Fpposite: normalized by their 
sum: 


AF /Fmax — AF / Fopposite 
AF /Fmax + AF /Fopposite 


DI 


The sign of the DI was defined as positive for front-to-back movement and 
negative for back-to-front movement. 
White-noise-based receptive fields and response predictions. RFs were recon- 
structed by thresholding calcium traces at 30% AF/F and averaging over all frames 
that induced a response larger than this threshold, weighted by peak calcium res- 
ponse. Only the rising slope of the calcium response was considered. The displayed 
RFs are the weighted averages of stimulus frames and the mean value (background) 
is subtracted. Max is the maximum of the weighted average after subtracting the 
mean value, and min is the minimum of this average. The mean value is set to 0. 

To predict responses to oriented bars (Fig. 2), we multiplied the mean-sub- 
tracted white-noise-based RF with stimulus values in each pixel and summed over 
all pixels. We convolved the result with a calcium response function. The ampli- 
tude and time constants of the calcium response function were fitted using res- 
ponses to bars of one orientation and then used in the prediction of responses to 
the remaining three orientations. 

The calcium response function, crf, used in predicting responses to oriented 
stimuli and in the analysis of calcium responses recorded during behaviour (see 
below) is given by: 


rf ~(1—(e =) (et) 
where fy is the onset time and f,, and fog are the rise and decay times of the calcium 
indicator. 
Walking behaviour analysis. Ball movement was recorded with a sampling rate of 
4kHz and velocities were calculated with a time base of 250 ms. Velocities were 
then averaged over all velocity values in each two-photon imaging frame. Average 
velocities were calculated by detecting epochs in which the fly was moving and 
then averaged over this period. Owing to strong walking activity in behaving 
animals, there were few recordings that allowed us to compare microglomerular 
responses in walking and stationary conditions (Extended Data Fig. 10e-n). 
Flight behaviour analysis. The wing angle was detected by first manually iden- 
tifying the wing hinge in both wings. The wing was then detected by first sub- 
tracting the background recorded while the fly was not flying, smoothing with a 
mean filter, and setting a threshold for detecting the wings in two ROIs surround- 
ing the wings. The wing angle was then defined as the angle between the wing 
hinge and the tip of the wing. Flight was typically intermittent (Extended Data Fig. 
100, p). 
Model for fitting responses of visual microglomeruli during behaviour. For 
describing the calcium responses of visual microglomeruli during walking beha- 
viour we fitted the responses with a model consisting of a single excitatory and 
inhibitory Gaussian function. Model fitting was only performed for microglomer- 
uli that responded with a single peak during one passage of the stimulus on the 
display, not for bilateral receptive fields. The initial position of each Gaussian, the 
(common) standard deviation and the amplitude were used as fit parameters. 


©2013 Macmillan Publishers Limited. All rights reserved 


Since the inhibitory responses could not be directly observed in our calcium 
signals, we set all negative values to zero. 


rf ~(e~ &—%0) 120%) _ gH /209)) 


rf = 0 if rf<0; Xp and x, are the location of the excitatory and inhibitory receptive 
fields, respectively, and o, is the width of the receptive field. The calcium signal s 
(AF/F) was then modelled as the convolution of the receptive field with the 
calcium response function, crf, and a constant offset, bg: 


s~erf @rf + bg 


Because the response in flying flies depended on the state of the fly, we used a 
model for the receptive field that additionally included the left and right wing 
angles as parameters: 


=a 2 
tfaying ~ behaviour-(e~ *—*) /Q93) __ ~(x—m) /20%)) 


rf = Oif rf<0; behaviour = a:lw+ b-rw+c depends linearly, with the parameters a, 
b and c, on the left wing angle /w and the right wing angle rw. 
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Extended Data Figure 1 | Pan-neuronal receptive fields for LTR 
microglomeruli. a, Selected two-photon imaging frames showing calcium 
response in the LTR with pan-neuronal expression of GCaMP5. Highlighted in 
red are the microglomeruli that were selected for receptive field reconstruction. 
Scale bar, 5 um. b, Schematic of visual stimulation used, receptive fields (two- 
trial averages) measured with left-and right-moving visual stimuli and their 
intersection labelled at full width at half maximum (FWHM) intensity (white 


LETTER 


ellipse) with the weighted centroid (white asterisk) of the distribution. c, Centre 
of each receptive field plotted against the centre of its corresponding 
microglomerulus. d, Correlation coefficient of data in d (r = 0.39, P = 0) and 
correlation coefficient of same data after randomly permuting the 
microglomeruli (1 = 3,000 permutations, mean = (4.6 + 0.076) x 10°). 

e, Principal component axes of receptive field centres (see Methods). 
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Extended Data Figure 2 | Ring neurons make dendritic projections to LTR 
microglomeruli and send axonal arbors to the rings of the ellipsoid body. 
a, A frame of a confocal stack of the LTR labelled with antibody staining against 
GFP using a pan-neuronal driver line, R57C10-LexA and antibody staining 
against red fluorescent protein (RFP) of EB1-GAL4. d, dorsal; m, medial. b, A 
section of a confocal stack of the LTR labelled with GFP using a pan-neuronal 


driver line R57C10-LexA and RFP labelling of c232-GAL4. c, Multicolour FLP- 
out labelling of c232-GAL4 (see Methods). d, Antibody staining of c232-GAL4 
expressing the dendritic marker, DenMark (red) and an axonal label, 
synaptotagmin-tagged GFP (green). Entire cell is outlined with membrane- 


tagged HA (blue) (see Methods). Top panel is merge of all colours. e, Same as 
d for EB1-GAL4. 
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Extended Data Figure 3 | Ring neurons respond with strong calcium 
transients to single stationary bright dots; receptive fields of ring neuron 
populations are spread non-uniformly across the largely ipsilateral visual 
field. a, Overlay of all regions of interest of R4d microglomeruli recorded in a 
fly during visual stimulation over an average of all frames in a calcium video 
(940 frames), and selected frames of the calcium video showing responses of 
individual microglomeruli. Scale bar, 5 tm. b, An example of the single dot 
stimulus appearing at the specified azimuth and elevation for 1 s followed by a 
dark period of 1 s (coloured dot indicates both the stimulus and the subsequent 
off period) (top). AF/F for microglomerulus 3 in a responding to the stimulus 
(bottom). ¢, Excitatory receptive fields for all microglomeruli shown in 

a. d, Overlay of all regions of interest selected among R2 microglomeruli 
recorded during visual stimulation over an average of all frames (n = 940) ina 
calcium video, and selected frames of the calcium video showing responses of 
individual microglomeruli. Scale bar, 5 1m. e, Azimuth and elevation of the 
single bright dot presented for 1 s followed by a 1 s interval without stimulus 
(top). AF/F for microglomerulus 1 in d responding to the visual stimulus 


(bottom). f, Excitatory receptive fields for all microglomeruli shown in 

d. g, Overlapping receptive fields of R4d neurons. Ellipses have the same 
normalized second central moments as the receptive fields shown in 

c thresholded at 30% AF/F. Intensity scale indicates how many receptive fields 
overlap in a given region. h, Same as g for R2 neurons shown in f. i-k, Average 
receptive fields measured across all flies with single dot stimulation for R4d 
neurons (green, n = 8 flies, 159 receptive fields) (i), for R2 neurons (purple, 

n = 12 flies, 219 RFs) (j) and for both (overlay) (k). Recordings from the right 
lateral triangle were flipped to the contralateral side for averaging. Although a 
large fraction of R2 and R4d ring neurons we imaged were visually responsive, 
there were ring neurons that did not respond in our experiments. Such neurons 
may have receptive fields in areas of the fly’s visual field that we did not sample 
(for example, very lateral areas), be tuned to visual dimensions that we did not 
explore (such as colour or polarization or complex shapes), or be selectively 
responsive to other sensory modalities (such as mechanosensation or 
thermosensation). 
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Extended Data Figure 4 | Ring neuron orientation preference and 
selectivity, direction selectivity and receptive field centre positions. 

a, Correlation coefficients between predicted and measured responses for R2 
neurons (histogram over all trials and receptive fields) (n = 6 flies). b, Same as 
a, for R4d neurons (n = 6 flies). c, Microglomeruli of R4d neurons coloured 
according to orientation preference and orientation selectivity (six-trial 
averages). d, Azimuth and elevation of the centre of the excitatory receptive 
fields of the neurons shown in c (two-trial averages, measured using moving 


bars as described in Extended Data Fig. 1 and Methods). e, Direction selectivity 
for neurons in c (six-trial averages). f, g, Microglomeruli shown in Fig. 2k, 1 
labelled according to the azimuth and elevation of their excitatory receptive 
field centre (measured using moving stimuli as in Extended Data Fig. 1) (f) and 
their orientation selectivity (four-trial averages) (g). h-p, Distributions of 
preferred orientation, orientation selectivity and direction index for R2 (n = 10 
flies), R4d (n = 12) and pan-neuronal (n = 11 flies) microglomeruli. 
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Extended Data Figure 5 | Comparison of orientation tuning curves excitatory—inhibitory structure of receptive fields (Fig. 2c, Extended Data Fig. 
measured using dark-on-bright and bright-on-dark bars. a, Orientation 8), tuning curves measured with the two stimuli are qualitatively similar, but 


tuning curves (GCaMP6s) of three R4d microglomeruli measured using (top _ bright bars produce stronger responses (with our low-contrast, filtered LED 
row) bright bars on a dark background (as in Fig. 2f) and dark bars ona bright —_ array display, see Methods). b, Fitted tuning curves measured using dark and 
background (exact contrast inversion of bright-bar stimulus). As suggested by __ bright bars are highly correlated (mean = 0.648 + 0.26, n = 7 flies). 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


R4d microglomerular receptive fields 
Glom. 5 


Fly1 


Different lateral tirangles/flies 


levation (°) 


Ww 


Azimuth (°) 


—s 
oa 


Number of pairwise 
comparisons 
ae 
oO 


A 
=| -05 0 05 1 
Cross-correlation coefficient 


Extended Data Figure 6 | Stereotypy of excitatory receptive fields across aligned by similarity. b, Cross-correlation coefficient between fly 1 and all other 
flies in R4d neurons. a, Excitatory receptive fields measured using single dot __ receptive fields in each column (mean = 0.74 + 0.2). 
stimulation in one fly (fly1), and receptive fields recorded in other flies (n = 12) 
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Extended Data Figure 7 | Stereotypy of excitatory receptive fields across excitatory receptive fields for R4d and R2 ring neurons measured with single 
flies in R2 neurons; summary of receptive field properties. a, Same as dot stimulation calculated at half maximum AF/F. The retinotopic correlation 
Extended Data Fig. 6a, measured in R2 neurons in 8 flies. b, Cross-correlation _coefficients are significantly different from the control distributions, with 
coefficient between fly 1 and all other receptive fields in each column for R2 P= 0.0283 for R4d (n = 8 flies, 159 receptive fields) and P = 0.0069 for R2 
neurons shown in a (mean = 0.75 + 0.12). c, Table showing characteristics of | (n = 12 flies, 219 receptive fields). 
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Extended Data Figure 8 | Stereotypy of white-noise-based receptive fields consisted of presentation of 60 stimulus frames (see Methods). b, Receptive 
across flies in R2 neurons. a, Receptive fields measured using white-noise fields measured using white-noise stimulation in 7 flies aligned by similarity 
stimulation in 6 flies aligned according to similarity. We used an average of (subset of this data are shown in Fig. 3). We used an average of 33 + 20 trials 
41 + 13 trials (6 flies, number of trials: 38, 25, 39, 63, 35 and 47). Each trial (7 flies, number of trials: 59, 61, 29, 21, 21, 14 and 26). 
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Extended Data Figure 9 | Ring neuron visual responses in the LTR are 
diminished during flight but not walking. a, Microglomeruli selected by 
visually assisted k-means clustering in R2 neurons, a subset of which are shown 
in Fig. 4b. b, Average over n = 24 trials of responses of R2 microglomeruli 

to a bright bar (15° X 90°) moving at 30° s * shown during a period when the 
fly is walking more than 50% of the time (blue) and less than 50% of the time 
(red) during rightward moving stimulation (envelopes show s.d.). 

c, d, Correlation coefficient (mean = 0.72 + 0.25) between model 

(see Methods) and data for R2 microglomeruli recorded in n = 8 flies (c), and 
for R4d microglomeruli (n = 14 flies, mean 0.7 + 0.25) that have unilateral 
receptive fields (d). e, Concatenation of seven epochs during walking behaviour 
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with EB1-GAL4, UAS-GCaMP3 flies during stimulation with a left and right 
moving bar: data (blue) and model (red) fit (see Methods). f, Microglomeruli in 
R4d selected using visually assisted k-means clustering. g, Visual responses 
during flying and non-flying behaviour during stimulation with a bar moving 
from left to right (averaged over n = 14 trials for flying, n = 14 trials for 
non-flying), and during stimulation with a bar moving from right to left (n = 17 
for flying and n = 11 trials for non-flying) for the fly shown in Fig. 4e (envelopes 
show s.d.). h, Correlation coefficient between model (see Methods) and data 
for R4d neurons with a linear dependence of the visual response on both wing 
angles (mean = 0.65 + 0.27) recorded in n = 14 flies. 
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Extended Data Figure 10 | Selection of microglomeruli using visually 
guided k-means clustering; walking and flight activity during behaving fly 
experiments. a, Cross-correlation matrix between all pixels above threshold 
(see Methods for details). b, Cross-correlation matrix sorted according to 
clusters found by k-means clustering. c, Resulting regions of interest. d, Overlay 
of resulting regions of interest and selected frames of calcium video. Regions of 
interest that do not correspond to microglomeruli (based on comparison with 
the calcium video) are removed. e-g, Velocities measured: forward (e), side 
(f) and rotational (h) for all c232-GAL4, UAS-GCaMP3 flies recorded during 
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walking behaviour. h, Percentage of walking for all EB1-GAL4 UAS-GCaMP3 
flies recorded. i-k, Velocities forward (i), sidewise (j) and rotational 

(k). 1, Percentage of walking for all EB1-GAL4, UAS-GCaMP3 flies recorded 
during walking behaviour. m, Number of 130s epochs recorded for each 
c232-GAL4, UAS-GCaMP3 walking fly. n, Number of 130 s epochs recorded for 
each EB1-GAL4, UAS-GCaMP3 walking fly. o, Percentage of time flying during 
recording for all c232-GAL4, UAS-GCaMP3 flies recorded during flight 
behaviour. p, Number of 130s epochs recorded for each c232-GAL4, 
UAS-GCaMP3 flying fly. 
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SHANKS and IGF1 restore synaptic deficits in 
neurons from 22q13 deletion syndrome patients 


Aleksandr Shcheglovitov', Olesya Shcheglovitova', Masayuki Yazawa’, Thomas Portmann', Rui Shu’, Vittorio Sebastiano”, 
Anna Krawisz', Wendy Froehlich*’, Jonathan A. Bernstein*, Joachim F. Hallmayer? & Ricardo E. Dolmetsch® 


Phelan-McDermid syndrome (PMDS) is a complex neurodevelop- 
mental disorder characterized by global developmental delay, severely 
impaired speech, intellectual disability, and an increased risk of autism 
spectrum disorders (ASDs)'. PMDS is caused by heterozygous dele- 
tions of chromosome 22q13.3. Among the genes in the deleted 
region is SHANK3, which encodes a protein in the postsynaptic 
density (PSD)*’. Rare mutations in SHANK3 have been associated 
with idiopathic ASDs* ’, non-syndromic intellectual disability*, and 
schizophrenia’. Although SHANK3 is considered to be the most 
likely candidate gene for the neurological abnormalities in PMDS 
patients”®, the cellular and molecular phenotypes associated with this 
syndrome in human neurons are unknown. We generated induced 
pluripotent stem (iPS) cells from individuals with PMDS and autism 
and used them to produce functional neurons. We show that PMDS 
neurons have reduced SHANK3 expression and major defects in 
excitatory, but not inhibitory, synaptic transmission. Excitatory 
synaptic transmission in PMDS neurons can be corrected by restor- 
ing SHANK3 expression or by treating neurons with insulin-like 
growth factor 1 (IGF1). IGF1 treatment promotes formation of 
mature excitatory synapses that lack SHANK3 but contain PSD95 
and N-methyl-p-aspartate (NMDA) receptors with fast deactivation 
kinetics. Our findings provide direct evidence for a disruption in the 
ratio of cellular excitation and inhibition in PMDS neurons, and 
point to a molecular pathway that can be recruited to restore it. 
To study cellular and molecular phenotypes associated with PMDS 
in humans, we generated six iPS cell lines from two patients with 
PMDS (Supplementary Table 1) that had deletions of approximately 
1 megabases in chromosome 22 (Supplementary Fig. 1 and Supplemen- 
tary Table 2) by using standard reprogramming techniques’*”. As 
controls, we used iPS cell and human embryonic stem (ES) cell lines 
that were characterized in previous studies'*’’. Control and PMDS iPS 
cell lines had typical human ES-cell-like morphologies and expressed 
endogenous pluripotency marker proteins, but had suppressed the 
expression of the exogenous reprogramming factors, had normal kar- 
yotypes, and formed teratomas in vivo (Supplementary Fig. 2 and 
Supplementary Table 3). We differentiated control and PMDS iPS cells 
into neurons using a modified monolayer differentiation protocol 
(Supplementary Fig. 3) and characterized the population of differen- 
tiated cells by quantitative PCR with reverse transcription (qRT-PCR) 
(Supplementary Fig. 4). Similar populations of neuronal cells were 
found in cultures of control and PMDS neurons. Analysis of gene 
expression with multiplex single-cell qaT-PCR’**"* demonstrated that 
approximately 42% of cells expressed pan-neuronal markers, 15% 
expressed cortical layer markers, and only 2-5% of the cells expressed 
the synaptic markers SHANK1-SHANK3 (Supplementary Fig. 5), indi- 
cating that only a small fraction of the cells were synaptically mature 
neurons. To enrich for mature forebrain neurons, we infected cells with 
enhanced green fluorescent protein (GFP) under the control of the 
CaMKII promoter (CaMKIIa-GFP)**"* (Fig. 1a). Single-cell profiling 


of GFP-expressing cells showed an approximate 12-fold enrichment in 
cells expressing SHANK 1-SHANK3 (Fig. 1b). These cells also expressed 
synaptic markers and genes typically found in the cortex and striatum, 
including TBR1, CUX1, SATB2, SOX5, FOXP1, CTIP2 (also known as 
BCL11B) and FOXP2 (Supplementary Fig. 5) and were stained with 
antibodies recognizing MAP2 (~99%), TBR1 (~29%), CTIP2 (~66%), 
SATB2 (~42%), and y-aminobutyric acid (GABA) or GAD67 (~7.5%) 
(Fig. 1c-h and Supplementary Fig. 6). 

We next investigated the electrophysiological properties of control 
and PMDS neurons. Control and PMDS neurons expressing GFP had 
similar action potential characteristics, resting membrane potential 
and capacitance (Supplementary Table 4). PMDS neurons, however, 
had an increased input resistance and reduced amplitude and fre- 
quency of spontaneous synaptic events (Fig. li-j and Supplementary 
Table 4). A reduction in synaptic transmission in PMDS neurons 
could arise from differences in the number of neurons present in 
cultures of PMDS and control neurons. In fact, anti-MAP2 antibody 
staining revealed that our PMDS iPS cell lines produced slightly fewer 
neurons than controls (Supplementary Fig. 7). To control for this 
difference in neuronal density and to study cells of both genotypes 
under identical experimental conditions, we infected control and PMDS 
neurons with either CaMKIIo-GFP or CaMKIIau-mKate2 and grew 
them together in the same dish (Fig. 2a). Under these conditions, we 
found that the amplitude and frequency of spontaneous excitatory post- 
synaptic currents (EPSCs) recorded from PMDS neurons were signifi- 
cantly reduced relative to controls (Fig. 2b, c), indicating that synaptic 
defects in PMDS neurons are postsynaptic and cell-autonomous. To 
determine if the changes in spontaneous EPSCs were also reflected in 
differences in evoked «-amino-3-hydroxy-5-methyl-4-isoxazolepropio- 
nic acid (AMPA) and NMDA-mediated synaptic responses, we measured 
currents induced by local field stimulation at -70 mV (AMPA-EPSCs) 
and +60mV (NMDA-EPSCs). Smaller AMPA- and NMDA-EPSCs 
were measured in PMDS neurons in response to different stimulus intens- 
ities (Fig. 2d—f), indicating that both AMPA- and NMDA-mediated syn- 
aptic transmissions are significantly impaired. 

To determine if inhibitory synaptic transmission was also altered in 
PMDS neurons, we recorded spontaneous and evoked inhibitory post- 
synaptic currents (IPSCs) (Fig. 2g-j). There were no differences in 
IPSCs measured from co-cultured control and PMDS neurons. To 
determine the direction of the voltage shifts induced by IPSCs at this 
developmental stage, we measured voltage deflections induced by focal 
-aminobutyric acid application (Supplementary Fig. 8). Most control 
and PMDS neurons responded with hyperpolarizing shifts in mem- 
brane potential. These results indicate that the defects in PMDS neu- 
rons are limited to excitatory synaptic transmission and suggest that 
there is an alteration in the balance of cellular excitation and inhibition 
in PMDS neurons. 

To gain insight into the mechanism underlying the impairment of 
excitatory synaptic transmission in PMDS neurons, we measured the 
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Figure 1 | CaMKIIa-GFP labels functional iPS cell-derived forebrain 
neurons that express SHANK3. a, Image of iPS cell-derived neurons labelled 
with EGFP under the control of CaMKII« promoter. b, Fraction of cells 
expressing SHANK1-3, selected either randomly (unlabelled, n = 1 line (total 
90 cells)) or based on CaMKIIa-GFP expression (CaMKIIo-GFP™, n = 8 (90)) 
and assessed by multiplex single-cell (RT-PCR (Supplementary Fig. 6). h18S, 
human 18S ribosomal RNA. c-g, Images of iPS cell-derived neurons 
immunostained with antibodies against GFP and MAP2 (c), TBR1 (d), CTIP2 
(e), SATB2 (f), and GAD67 (g). h, Fraction of cells expressing MAP2 (n = 3 
cover slips (682 cells)), TBR1 ( = 4 (566)), CTIP2 (n = 8 (853)), SATB2 (n = 6 
(646)), and y-aminobutyric acid (GABA)/GAD67 (n = 5 (493)) among GFP- 
expressing cells. i, Representative recordings of action potentials (left, induced 
by somatic current injections, AI = 5 pA) and input resistance measurements 
(right) obtained from GFP-expressing control (n = 22 cells from 3 lines) and 
PMDS neurons (n = 36 cells from 5 lines). j, Representative recordings (left) 
and quantification of amplitude and frequency (right) of spontaneous EPSCs 
obtained from GFP-expressing control (n = 19 cells from 3 lines) and PMDS 
neurons (” = 26 cells from 5 lines) (Supplementary Table 4). Data presented as 
means + s.e.m.; ***P < 0.001, Mann-Whitney test. Scale bars, 50 jim. 


expression levels of AMPA and NMDA receptors. PMDS neurons 
expressed significantly less GLUA1 and GLUN1 proteins (Fig. 3a) 
and generated significantly smaller currents in response to the focal 
application of either AMPA (Fig. 3b) or NMDA (Fig. 3c). In contrast, 
the amplitude of y-aminobutyric-acid-evoked currents was similar for 
both control and PMDS neurons (Supplementary Fig. 9), indicating 
that PMDS neurons express a decreased number of excitatory neuro- 
transmitter receptors and normal levels of inhibitory receptors. 

To determine whether the impairment in excitatory synaptic trans- 
mission in PMDS neurons is due to a reduced number of synapses, 
we immunostained co-cultured control and PMDS neurons with anti- 
bodies that recognize the presynaptic protein synapsin 1 (SYN1) and 
postsynaptic protein HOMERI (Fig. 3d). We observed a significant 
decrease in the number of puncta that were SYN1* (Fig. 3e) and 
HOMER1" SYN1* (Fig. 3f) in PMDS neurons as compared to con- 
trols, indicating that PMDS neurons have significantly fewer excitatory 
synapses than controls. 
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Figure 2 | PMDS neurons display impaired excitatory synaptic 
transmission. a, Images of control (green) and PMDS (red) neurons co- 
cultured on a bed of rat cortical astrocytes (scale bar, 50 tm), top. Traces of 
spontaneous EPSCs recorded at —70 mV, bottom. b, c, Cumulative distribution 
and quantification of amplitude (b) and frequency (c) of spontaneous EPSCs 
(n = 22 control and 16 PMDS cells). d, Traces of evoked AMPA- 

(Vn = —70 mV) and NMDA-EPSCs (Vj, = +60 mV). Arrows indicate the time 
of stimulation. Stimulation artefacts were blanked. e, f, Input-output curves of 
evoked AMPA- (measured at the peak; n = 15 control and 23 PMDS cells) and 
NMDA-EPSCs (measured 50 ms post-stimulus; n = 14 control and 20 PMDS 
cells) recorded in response to different stimulus intensities. Bar graphs show 
quantification of maximum current amplitude. g, Traces of spontaneous IPSCs 
recorded at —70 mV. h, i, Cumulative distribution and quantification of 
amplitude (h) and frequency (i) of spontaneous IPSCs (m = 21 control and 23 
PMDS cells). j, Representative traces and input-output curves of evoked IPSCs 
(n = 17 control and 16 PMDS cells). Data presented as means = s.e.m.; 

*P < 0.05, **P < 0.01, Mann-Whitney test. 


Although SHANK3 has been proposed to be primarily responsible 
for the neurological phenotype in PMDS patients, the vast majority of 
PMDS patients have 10-100 additional genes deleted. To determine 
whether the synaptic phenotypes in PMDS neurons are due to reduced 
SHANK3 expression, we measured the levels of SHANK3 messenger 
RNA and protein. Whereas MAP2, SHANK1 and SHANK2 were 
expressed at similar levels in control and PMDS neurons (Fig. 4a), 
SHANK3 mRNA was significantly reduced in PMDS (Fig. 4a). 
Western blot analysis showed a significant reduction in the expression 
of the longest SHANK3 protein isoform (Fig. 4b), which was expressed 
primarily in neurons (Supplementary Fig. 10). Immunostaining with 
anti-SHANK3 antibodies on co-cultured control and PMDS neurons 
also revealed a significantly reduced level of SHANK3 protein expression 
in PMDS neurons (Fig. 4c), indicating that loss of a copy of SHANK3 in 
PMDS significantly downregulates the expression of the protein. 

We next investigated whether we could reverse the synaptic pheno- 
type in PMDS neurons by increasing SHANK3 expression. We infected 
PMDS neurons with a lentivirus containing the longest isoform of 
SHANK3 fused to EGFP (Fig. 4d and Supplementary Fig. 11). SHANK3 
expression in PMDS neurons restored both the amplitude and fre- 
quency of spontaneous EPSCs (Fig. 4e-g). It also completely rescued 
evoked AMPA- and NMDA-EPSCs in 43% of recorded cells, and in 
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Figure 3 | PMDS neurons show reduced expression of glutamate receptors 
and decreased number of synapses. a, Quantification of expression of AMPA 
and NMDA receptors in control and PMDS neurons using western blot 

(day in vitro 45-50; n = 3-11 pairs of samples; **P < 0.01, ***P < 0.001, 
one-sample t-test). GAPDH was used as a loading control. b, c, Representative 
currents (top) and peak current versus voltage curves (bottom) recorded in 
control and PMDS neurons in response to focal application of 200 14M AMPA 
(measured at a holding potential of—70 to +50 mV, A20 mV, n = 7 control 
and 11 PMDS cells) and 100 uM NMDA/10 uM Gly (—60 to +60 mV, 

A20 mV, n = 21 control and 30 PMDS cells). Data presented as means + s.e.m.; 
*P < 0.05, ***P < 0.001, Mann-Whitney test. d, Images of co-cultured control 
(green) and PMDS (red) neurons immunostained for mKate2, GFP, SYN1 
and HOMER1. Scale bars, 20 (left) and 5 lm (right). e, f, Cumulative 
distributions of the number of SYN1* (e) and HOMERI1* SYN1" (f) puncta 
on co-cultured control and PMDS neurons; *P < 0.05, ***P < 0.001, 
Kolmogorov-Smirnov test. 


the remaining cells, it rescued only AMPA-EPSCs (Fig. 4h-j). This 
indicates that loss of SHANK3 substantially contributes to the impaired 
excitatory synaptic transmission detected in PMDS neurons; however, 
it also suggests that SHANK3 may act differently on AMPA and 
NMDA receptors. 

We next tested if we could restore synaptic deficits in PMDS neurons 
pharmacologically, using agents that have been previously reported to 
increase the level of SHANK3 expression or to increase synaptic trans- 
mission, including trichostatin A (TSA)'”"%, valproic acid (VPA)'”””, 
nifedipine (Nif)”°, IGF1*"” and IGF2™*. We treated control and PMDS 
neurons over the course of 2 to 20 days and measured the number 
of HOMER1" SYN1* puncta. Only IGF1 increased the number of 
SYN1* HOMERI1™ puncta to the levels observed in control neurons 
in the same culture (Fig. 5a—b). We also found that IGF1 increased the 
average amplitude and frequency of spontaneous EPSCs in PMDS 
neurons to control levels (Fig. 5c), but had no effect on spontaneous 
IPSCs (Supplementary Fig. 12). IGF1 also restored the amplitude of 
evoked AMPA- and NMDA-EPSCs (Fig. 5d) and reduced the input 
resistance of PMDS neurons to that observed in control neurons 
(Supplementary Fig. 13). Finally, IGF1 completely restored the 
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Figure 4 | Reduced SHANK3 expression contributes to synaptic defects in 
PMDS neurons. a, Relative expression of SHANK1, SHANK2, SHANK3 and 
MAP2 mRNAs in iPS cell-derived neurons (day in vitro 30-35; n = 5 control 
and 6 PMDS biological replicates; *P < 0.05, **P < 0.01, Student’s t-test). 

ex, exon. b, Quantification of SHANK3 expression in control and PMDS 
neurons as assessed by western blot (day in vitro 45-50; n = 4 pairs of samples; 
**P < 0.01, one-sample t-test). GAPDH was used as a loading control. 

c, Images (left) and quantification (right) of SHANK3 expression in co-cultured 
control (red) and PMDS (green) neurons (scale bar, 20 j1m). d, Images of 
PMDS neurons infected with GFP-tagged SHANK3 lentiviruses and 
immunostained with antibodies against MAP2, GFP and SYN1 (scale bars, 20 
(top) and 5 (bottom) jim). e, Traces of spontaneous EPSCs recorded from 
uninfected and SHANK3-infected PMDS neurons at -70 mV. f, g, Cumulative 
distribution and quantification of amplitude (f) and frequency (g) of 
spontaneous EPSCs (m = 8 uninfected and 17 SHANK3-infected cells). 

h, Traces of evoked AMPA- (Vi, = —70mV) and NMDA-EPSCs 

(Vi, = +60 mV). i, j, Quantification of maximum amplitudes of evoked 
AMPA- (i, measured at peak) and NMDA-EPSCs (j, measured 50 ms post 
stimulus) recorded from uninfected (n = 6) and SHANK3-infected (n = 14) 
PMDS neurons. Data presented as means + s.e.m.; *P < 0.05, ***P < 0.001, 
Mann-Whitney test. 


whole-cell NMDA receptor currents evoked by focal application of 
NMDA, but failed to fully restore whole-cell AMPA receptor currents 
elicited by application of AMPA (Supplementary Fig. 14). Together, 
these results indicate that treatment with IGF1 restores synaptic trans- 
mission in PMDS neurons by increasing the number of synaptic 
AMPA and NMDA receptors. 

To gain insight into the mechanism of IGF1 action, we measured 
SHANK3 expression in control and PMDS neurons treated with IGF1. 
Surprisingly, we found that IGF1 caused a marked decrease of SHANK3 
protein expression in the cell body and processes of both control and 
PMDS neurons (Supplementary Fig. 15). We next examined SHANK3 
expression at synapses by infecting cells with GFP-tagged SHANK3 
and staining neurons with the postsynaptic antibody anti-PSD95 (Sup- 
plementary Fig. 16a). We found only 50% overlap between SHANK3- 
and PSD95-containing puncta in human iPS cell-derived neurons. A 
similar pattern of SHANK3 and PSD95 expression was detected in the 
human fetal brain tissue immunostained with anti-SHANK3 and anti- 
PSD95 antibodies (Supplementary Fig. 16b). This indicates that in 
developing neurons there are three populations of excitatory synapses: 
those that contain SHANK3 alone, those that contain SHANK3 and 
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Figure 5 | IGF1 treatment restores excitatory 
synaptic transmission in PMDS neurons. 

a, Quantification of the number of 

HOMERI1* SYN1* puncta in co-cultured neurons 
after treatment with different agents (w/o : n = 51 
control, 43 PMDS cells; TSA: m = 11/12, VPA: 

n = 25/29, Nif: n = 9/8, IGF2: n = 31/40, IGF1: 

n = 35/32). w/o, without treatment. b, Images of 
co-cultured control (green) and PMDS (red) 
neurons after IGF1 treatment. Scale bars, 20 (top) 
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treated cultures at —70 mV (n = 18 control and 
25 PMDS cells). d, Recordings (top) and 
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quantification of amplitudes (bottom) of evoked 
AMPA- (measured at the peak, V;,, = —70 mV) and 
NMDA-EPSCs (measured 50 ms post-stimulus, 
Vi, = +60 mV) acquired in IGF1-treated cultures 
(n = 14 control and 21 PMDS cells). Dashed lines 
represent data originally presented in Fig. 2. 

e, Images of co-cultured control (green) and PMDS 
(red) neurons in untreated (top) and IGF1-treated 
(bottom) cultures, immunostained with antibodies 
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PSD95, and those that contain PSD95 alone. We investigated the effect 
of IGF1 on these three populations of synapses by staining neurons 
with anti-SHANK3 and anti-PSD95 antibodies (Fig. 5e). Treatment 
with IGF1 caused a decrease in the number of puncta expressing 
SHANK3 alone in control neurons, but had little effect in PMDS 
neurons, which already had reduced levels of SHANK3 (Fig. 5f). In 
contrast, IGF1 caused a 45% and 340% increase in the fraction of 
puncta expressing PSD95 in control and PMDS neurons, respectively, 
while decreasing the number of puncta containing both markers 
(Fig. 5f). To determine if synapses that lack SHANK3 are functionally 
different, we analysed the physiological properties of synapses in 
PMDS and control neurons. We found that PMDS neurons have a 
significantly faster rate of decay of NMDA-EPSCs relative to control 
cells and treatment with IGF1 increased the decay rate of NMDA- 
EPSCs in control neurons to match the rate of decay in PMDS neurons 
(Fig. 5g). In contrast, SHANK3 expression in PMDS neurons caused a 
decrease in the rate of decay of NMDA-EPSCs to match that in 
untreated control neurons (Fig. 5g). These results are consistent with 
the idea that IGF1 treatment rescues synaptic transmission in PMDS 
neurons by causing loss of SHANK3-containing synapses and gain of 
PSD95-containing synapses, which are characterized by rapidly decay- 
ing NMDA-EPSCs (Fig. 5h). 

Here, we show that neurons derived from iPS cells from PMDS 
patients have major defects in excitatory synaptic transmission arising 
from both a failure to form the correct number of excitatory synapses 
and a reduction in the expression of glutamate receptors. As no signi- 
ficant impairments in inhibitory synaptic transmission were found, 
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this resulted in an imbalance between excitation and inhibition at a 
cellular level. Only some aspects of the phenotype detected in human 
cells are similar to those found in mouse models of PMDS*’’”. Why 
there is a discrepancy is unclear. One potentially important issue is that 
there are multiple SHANK3 isoforms. PMDS patients generally lack 
the entire gene on one chromosome, whereas mice typically lack only 
some of the isoforms and have generally been studied as homozygote 
knockouts. We also demonstrated that SHANK3 and IGF! largely 
restore synaptic deficits in PMDS neurons. Interestingly, IGF1 decreases 
SHANK3 expression and promotes the formation ofa class of synapses 
containing PSD95 but lacking SHANK3. One possibility is that SHANK3 
is required at an early stage of synapse formation, and that IGF1 pro- 
motes maturation of synapses by triggering the loss of SHANK3 and 
recruitment of PSD95. Consistent with this view, SHANK3-containing 
synapses have slowly decaying NMDA receptor currents that resemble 
those of GLUN2B receptors that occur early in development, whereas 
synapses that contain PSD95 but lack SHANK3 have rapidly decaying 
NMDA receptor currents that resemble the GLUN2A-containing synapses 
that appear later in development”’. In summary, we have identified 
synaptic defects in iPS cell-derived neurons from patients with PMDS 
and we show that these defects depend on loss of SHANK3 and can be 
reversed by IGF1. 


METHODS SUMMARY 


Primary fibroblasts from consenting patients with PMDS were acquired by skin- 
punch biopsy following approved SCRO and IRB protocols. iPS cell lines from the 
fibroblasts were generated using standard reprogramming techniques'"””. iPS cells 
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were differentiated into neurons using a modified monolayer differentiation pro- 
tocol (Supplementary Fig. 3). Properties of iPS cell-derived neurons were char- 
acterized using qRT-PCR, multiplex single-cell qRT-PCR, electrophysiology, 
immunocytochemistry and western blot. A detailed description of all the proce- 
dures, materials and methods used in this study is available in the Methods section. 
Performed experiments are also summarized in Supplementary Table 3. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Recruitment of patients with PMDS and skin biopsy procedure. Participants 
diagnosed with 22q13 deletion syndrome were recruited from on-going research 
studies, the medical genetics service at Lucile Packard Children’s Hospital, and 
interest groups for patients and families with genetic disorders associated with 
autistic spectrum manifestations. Consenting participants were evaluated with 
the Autism Diagnostic Observation Schedule (ADOS) and, when possible, the 
Stanford-Binet Intelligence Scales, Fifth Edition (SB5). In addition, parents of 
participants completed the Autism Diagnostic Interview-Revised (ADI-R) and 
questionnaires about the participant, including a medical history questionnaire 
and the Social Responsiveness Scale (SRS). All assessors were trained nationally 
and certified as reliable in the administration of ADOS and ADI-R assessments. 
Skin punch biopsies were performed as follows: after local anaesthesia with topical 
and injected xylocaine, a shallow, 3-mm punch biopsy sample was obtained from 
either the forearm or thigh. 

This study was approved by both the Stanford University Institutional Review 
Board (IRB) and the Stem Cell Research Oversight Committee (SCRO). 
Generation and characterization of iPS cell lines. Detailed procedures used for 
the generation, maintenance, and characterization of iPS cells were previously 
described’. Briefly, primary fibroblasts from patients with PMDS were acquired 
by skin-punch biopsy following approved SCRO and IRB protocols. Cells were 
propagated for 2-3 passages, plated at 100,000 cells per well in six-well plates, and 
infected with retroviruses carrying SOX2, OCT3/4, c-MYC and KLF4 transcrip- 
tion factors the next day. iPS cells colonies, normally detected 21-30 days after the 
first infection, were routinely maintained on irradiated CF1 feeder cells in iPS cell 
medium (DMEM/F12 (11330-032, Gibco), containing 20% Knockout SR (10828- 
028, Gibco), 1% MEM (11140, Gibco), 0.5% L-glutamine (25030, Gibco), 1% 
penicillin streptomycin (15140, Gibco), 0.1 mM B-mercaptoethanol (Sigma) and 
10ng ml basic fibroblast growth factor (R&D Systems)). 

iPS cells were immunostained with antibodies against NANOG (AF1997, R&D 
Systems) and TRA-2-49/6E (developed by P. W. Andrews and obtained from the 
Development Studies Hybridoma Bank developed under the auspices of the 
National Institute of Child Health and Development (NICHD) and maintained 
in the University of Iowa, Department of Biological Sciences). 

SKY analysis of all iPS cell lines was performed according to a standard pro- 
cedure. Briefly, metaphases were extracted after overnight incubation in 0.03 ug 
ml! KaryoMAX Colcemid solution in iPS cell media. Chromosomes were iso- 
lated and stained with the SKY Paint probe mixture and CAD kit (Applied Spectral 
Imaging), following the manufacturer’s protocol. Spreads were imaged on a Leica 
microscope (DM400B), equipped with SpectraCubeSD300 and a X63 objective, 
using Case Data Manager 6.0 and HiSKY 6.0 software (Applied Spectral Imaging). 
Twenty spreads from each iPS cell line with well-separated chromosomes were 
imaged and analysed for visible chromosome abnormalities. 

For the teratoma formation assay, iPS cells from a confluent 10-cm dish were 
enzymatically collected, washed and resuspended in 1.5 ml of PBS. Cells were then 
injected into the kidney capsules of SCID mice (Charles River Laboratories 
International). Visible tumours were dissected 3-8 weeks post-transplantation and 
fixed in 4% paraformaldehyde. Fixed samples were sent to the Cureline Biopathology 
Laboratory for paraffin embedding, sectioning and staining with haematoxylin 
and eosin. Sections were then examined for the presence of tissue representatives of 
all three germ layers. 

Genomic DNA from fibroblasts and iPS cells was isolated using the DNeasy 
Blood & Tissue kit (Qiagen). Haploinsufficiency of SHANK3 in fibroblasts and all 
iPS cells of patients, was verified using the SALSA MLPA kit (P188, MRC-Holland), 
following the manufacturer’s protocol. 

Neural differentiation. For neural differentiation, a combination of two protocols 
was used”. The rationale and detailed procedures are presented in Supplemen- 
tary Fig. 3. 

qRT-PCR. qRT-PCR was performed in a real-time thermal cycler (Mastercycler 
ep realplex, Eppendorf) using either FastStart Universal SYBR Green (Rox) 
(Roche) or RT qPCR master mixes (SABiosciences). Total RNA was isolated using 
the RNeasy Mini kit (Qiagen). cDNA was synthesized with either the SuperScript III 
First-Strand Synthesis kit (Invitrogen) or the RT First Strand kit (SABiosciences). 
All primers are listed in Supplementary Table 5. 

Multiplex single-cell qRT-PCR. Single cells were collected either by aspiration 
with a patch glass pipette filled with 4 ul of sterile intracellular solution or by FACS 
sorting using a BD influx sorter (BD Biosciences) into 10 pil of a pre-amplification 
mix containing 40 nM of all primers for the genes of interest and the following 
components of the CellsDirect One-Step qRT-PCR kit (Invitrogen): 2 < Reaction 
Mix, SuperScript III RT/Platinum Taq Mix. After collection, samples were reverse- 
transcribed and pre-amplified for 18 cycles. Pre-amplified samples were diluted 
(3 X ) with TE buffer and stored at —20 °C. Sample and assay (primer pairs) prepara- 
tion for 96.96 Fluidigm Dynamic arrays was done according to the manufacturer’s 


recommendation. Briefly, each sample was mixed with 20 X DNA binding dye 
sample-loading reagent (Fluidigm), 20 x EvaGreen (Biotium) and TaqMan Gene 
Expression master mix (Applied Biosystems). Assays were mixed with 2 X assay 
loading reagent (Fluidigm) and TE to a final concentration of 5 uM. The 96.96 
Fluidigm Dynamic Arrays (Fluidigm) were primed and loaded on an IFC Controller 
HX (Fluidigm) and quantitative PCR experiments were run on a Biomark System 
for Genetic Analysis (Fluidigm). Melting curves were used to determine the speci- 
ficity of each reaction. In addition to single-cell material, every experiment con- 
tained four standard dilutions ofa mixed human cDNA library. Data were collected 
and analysed using Fluidigm Real-Time PCR Analysis software (v.2.1.3 and v.3.0.2). 
Threshold cycle (CT) as a measure for original template amount was used for 
quantification. To compare the data acquired from different chips, the C, values 
were normalized to the median intensity of the entire chip. Because single-cell gene 
expression in mammals occurs in bursts and follows a binary distribution, we 
established 32 cycles (at least five standard deviations from the mean of the lowest 
expressed genes) as a general threshold for whether or not a gene is expressed. 
Electrophysiology. All recordings were performed at room temperature (22-25 °C) 
on GFP- or mKate2-expressing iPS cell-derived neurons on day 31-45 (separate 
cultures without astrocytes) or day 45-52 (co-cultured on rat cortical astrocytes). 
Briefly, neurons grown on cover slips or 35-mm plastic dishes were visualized with 
a X40 air objective on an inverted Nikon microscope (Ellipse TE2000) and 
recorded using an EPC10 amplifier (HEKA). The following solutions were used 
(in mM): extracellular, 140 NaCl, 2.5 KCl, 2.5 CaCl,, 2 MgCl,, 1 NaH,PO,, 20 
glucose, 10 HEPES, pH 7.4; intracellular, 135 CsMeS, 5 CsCl, 10 HEPES, 0.5 EGTA, 
1 MgCl, 4 Mg,ATP, 0.4 NaGTP, 5 QX-314, pH7.4 with CsOH (for EPSCs, 
AMPA-, NMDA-, and y-aminobutyric acid-current measurements) and 125 
CsCl, 10 EGTA, 1 MgCl, 0.1 CaCl, 4 MgATP, 0.3 NaGTP, 10 HEPES, 5 QX- 
314 pH7.2 with CsOH (for IPSCs measurements). Recording pipettes made of 
borosilicate glass (BF150-110-10, Sutter Instruments) had a resistance of 3-6 MQ 
when filled with intracellular solution. The following combinations of antagonists 
were present in the extracellular solutions to isolate different postsynaptic currents 
(in uM): 20 bicuculline or 50 picrotoxin (spontaneous and evoked EPSCs), 10 
NBQX and 50 2-amino-5-phosphonopentanoic acid (APV) (spontaneous and 
evoked IPSCs), 10 NBQX and 20 bicuculline or 50 picrotoxin (NMDA-current, 
focal application), 50 APV and 50 picrotoxin (AMPA-current, focal application), 
10 NBQX and 50 APV (y-aminobutyric acid-current, focal application). Spontaneous 
EPSCs and IPSCs were recorded for 3 min (1 min after break-in to block sodium 
current by QX-314) at holding potential —70 mV. Recordings were filtered at 
2kHz and digitized at 4 kHz. Evoked postsynaptic currents were measured after 
spontaneous currents and induced by brief (1 ms) unipolar current pulses of 
various amplitudes (0.1-0.9mA, A = 0.05-0.1 mA), delivered with a stimulating 
electrode (CBAEC75, FHC) positioned 100-150 jtm from the cell soma and con- 
nected to an external stimulator (A365, WPI). Recordings were filtered at 2 kHz 
and digitized at 10 kHz. NMDA-, AMPA- and y-aminobutyric acid-currents were 
induced by a focal transient application of the following agonists (in 4M): 100 
NMDA and 10 Gly, 200 AMPA, or 100 -aminobutyric acid. An injection pipette 
(patch pipette with resistance of 1-2 MQ when filled with extracellular solution) 
was positioned about 20 um from the cell soma and connected to a pico-spritzer 
(PDES-02DxX, npi Electronics) with 10 p.s.i. in-line pressure. In all experiments, 
access resistance was monitored throughout the experiment (~15 MQ), but not 
compensated. Cells were included for analysis only if the change of resistance was 
<25% throughout the experiment. 

Data were collected and initially analysed with Patchmaster software (HEKA). 
Further analysis was performed using Clampfit 10 (Molecular Devices), IgorPRO 
(Wavemetrics), Excel (Microsoft) and Prism (GraphPad). The baseline noise 
amplitude in our experiments was about 2.5 pA. Spontaneous EPSCs were detected 
using a template search procedure in Clampfit 10. A template for automatic EPSC 
detection was created by averaging 50 manually picked synaptic events. Spon- 
taneous IPSCs were detected using MiniAnalysis software. The decay kinetics of 
NMDA-EPSCs was estimated using a weighted decay time constant (Tdecay)”'s 
calculated from the area under the normalized current on an interval from 
16.4ms (3 X Tampa decay) to 1.18 after the peak of AMPA-EPSC. 

Western blot analysis. Cells grown in 2-cm dishes were lysed in a standard RIPA 
buffer containing a protease inhibitor cocktail (Roche Diagnostics). Protein con- 
centrations were measured using a BCA protein assay kit (Thermo). Proteins were 
denatured with Laemmli buffer at 70°C for 10 min, separated on a NuPAGE 
4-12% Bis-Tris gel (Novex), and transferred to a polyvinylidene difluoride (PVDF) 
membrane. Membranes were blocked in 5% milk (1h at room temperature) and 
blotted with the following primary antibodies (overnight at 4 °C): guinea pig anti- 
SHANKS (1:700, gift from Carlo Sala), mouse anti-GLUN1 (1:1,000, SYSY), rabbit 
anti-GLUA1 (1:1,000, Upstate), mouse anti-GLUA2 (1:1,000, NeuroMab), and 
mouse anti-GAPDH (MS, 1:1,000, Ambion). To control loading protein amounts, 
we first quantified the relative concentrations of GAPDH in all lysates. Afterwards, 
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15-25 1g of proteins were loaded into a gel and intensities of appropriate bands 
were measured and normalized to the intensity of the GAPDH band in the same 
lane. Bands were visualized and quantified using the FluorChemQ System (Alpha 
Innotech). 

Immunocytochemistry, imaging and synaptic assays. Cells were fixed in 4% 
paraformaldehyde (15 min at room temperature), washed (3X) with PBS glycine, 
permeabilized with 0.5% Nonidet P40 (7 min at room temperature), washed (3X) 
with PBS, and blocked in 3% BSA (overnight at 4 °C). Primary antibodies were 
diluted in 3% BSA and applied overnight at 4 °C. The following primary antibodies 
were used: chicken anti-GFP (Abcam, 1:1,000), rabbit anti-tRFP (Axxora, 1:1,000), 
guinea pig anti-SHANK3 (produced by TM Boeckers, 1:800), mouse anti-SYN1 
(SYSY, 1:500), guinea pig anti- HOMER1 (SYSY, 1:200), mouse anti-PSD95 (Abcam, 
1:200), rat anti-CTIP2 (Abcam, 1:200), mouse anti-SATB2 (Abcam, 1:10), mouse 
anti-GAD67, and anti-nuclei (Millipore, 1:100). Secondary antibodies were diluted 
(1:500-1,000) in 3% BSA and applied for 1 h at room temperature. The following 
secondary antibodies were used: Alexa-405, -488, -594 and -647 (Invitrogen). 
Neurons were visualized using either a Zeiss spinning disc confocal microscope, 
equipped with 405/488/594/640 lasers (Perkin Elmer) or a laser-scanning confocal 
microscope equipped with 405/488/594/633 lasers (Stanford Neuroscience Micro- 
scopy Services), using either Volocity (Improvision) or Zenn (Zeiss) software. 
Z-stack images of fluorescent cells were taken with a 0.2-0.7 jum interval using 
X 100 or X63 objectives that cover an 80 X 80 or 150 X 150 uum” area, correspond- 
ingly. Images within an experiment were acquired with identical acquisition set- 
tings. Approximately 20 images (10 GFP* and 10 mKate2* neurons) were collected 
from each coverslip. Visualization and analysis of all images were performed using 
Volocity 5.1 and ImageJ. For quantification of total SHANK3 expression, a three- 
dimentional mask of a neuron was created based on GFP or mKate2 fluorescence 
and the mean SHANK3 intensity within the masked area was calculated. For 
quantification of the number of synapses, Z-stacks were merged using maximum 
intensity readings for each of the channels and a two-dimensional mask of a neuron 
was created based on the GFP or mKate2 fluorescence reading. Presynaptic (SYN1) 
and postsynaptic (SHANK3, HOMERI or PSD-95) puncta within a mask were 
found either manually using a cell counter plug-in for ImageJ or automatically using 
a custom measurement protocol created in Volocity. During automatic quantifica- 
tion, once settings for puncta detection were chosen, they were uniformly applied 
for all images within an experiment. Results of automatic puncta quantification 
were always validated by visual inspection and manual counting for a subset of the 
images. All cover slips were coded and processed in a blinded manner. To combine 
data acquired from different coverslips in different experiments, we normalized all 
puncta counts to the median number of puncta detected on control neurons for a 
particular cover slip. 

Construction of SHANK3 lentiviral vector. The translated region of the rat 
SHANK3 cDNA (NM_021676.1, generously provided by R. Xavier) was isolated 
in pieces by PCR and flanked with NdeI (5’) and Clal (3’) restriction enzyme sites, 
and then subcloned into the pSC-B-amp/kan vector using the StrataClone Blunt 
PCR Cloning kit (Stratagene). Similarly, a DNA fragment of EGFP flanked with 
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NotI (3’ UTR), NdeI (5' TR before stop codon), and Clal (3’ UTR) restriction 
enzyme sites was obtained by PCR and subcloned into a pSC-B-amp/kan vector. 
Afterwards, NotI-EGFP-Clal (0.7 kb) and NdeI-SHANK3-Clal (5.2 kb) fragments 
were isolated, gel-purified, and sequentially inserted into the NotI and Clal sites 
and then into an NdeI- and Clal-digested pHAGE-EF 1a lentiviral vector backbone” 
to generate the pHAGE-EFla-EGFP-SHANK3 (13 kb) vector. The sequence was 
confirmed by sequencing. 
Lentivirus production, titration and infection. Lentiviruses were generated with 
a five-plasmid transfection system using Fugene 6 transfection reagent (Roche) in 
293T cells as previously described**. 48h after transfection, supernatants were 
collected at 12-h intervals, 4-5 times in total. Viral particles were concentrated 
using Lenti-X concentrator (Clontech), aliquoted (200 jl), and stored at —80 °C. 
Different amounts of virus (5-100 Ll) were placed on ~80% confluent HEK293T 
cells for 3-4 days. Approximately 80% of the cells in the 35-mm plate were fluor- 
escent upon addition of 100 jul of virus. We used 100 pil of GFP-SHANK3 virus to 
transduce human iPS cell-derived neurons (Day 25-30) growing in 35-mm culture 
plates. The medium was replaced 10-20 h later. Neurons were transferred onto rat 
primary astrocytes after 5-10 days. 
Drugs and treatment procedure. The following agents were tested: 0.1 [M tri- 
chostatin A (TSA, Tocris), 1 mM valproic acid (VPA, Calbiochem), 3 uM nifedi- 
pine (Nif, Tocris), 20 ng ml? insulin growth factor 1 (IGF1, R&D Systems), and 
20ngml* insulin growth factor 2 (IGF2, R&D Systems). Concentrations were 
chosen on the basis of previous reports of their use available in the literature. 
Effects of TSA, VPA, and Nif were assessed after 24-36h of treatment and 
IGF1 and IGF2 after 2-3 weeks, with approximately 25% of the media changed 
every other day. 
Data collection and statistics. All experiments were replicated at least three times 
using iPS cell lines indicated in Supplementary Table 3. The sample size and 
description of the sample collection are reported in each figure legend. The num- 
ber of samples was selected based on the variability of the particular assay. Samples 
were excluded from analysis only when they were clear outliers (identified visually 
and confirmed with the Grubbs’ test) or were erroneously generated as a result ofa 
technical problem. Wells with neurons were randomly selected for an assay/drug 
treatment, coded, and processed in a blinded manner. Statistical tests used for 
comparison are reported in each figure legends. For data sets where parametric 
tests were used for analysis, the normal distribution of data was confirmed using 
the Kolmogorov-Smirnov test. 
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Paneth cells as a site of origin for intestinal inflammation 
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The recognition of autophagy related 16-like 1 (A TG16L1) as a gene- 
tic risk factor has exposed the critical role of autophagy in Crohn’s 
disease’. Homozygosity for the highly prevalent ATG16L1 risk allele, 
or murine hypomorphic (HM) activity, causes Paneth cell dysfunction””. 
As Atg1611"™ mice do not develop spontaneous intestinal inflam- 
mation, the mechanism(s) by which ATG16L1 contributes to disease 
remains obscure. Deletion of the unfolded protein response (UPR) 
transcription factor X-box binding protein-1 (Xbp1) in intestinal 
epithelial cells, the human orthologue of which harbours rare inflam- 
matory bowel disease risk variants, results in endoplasmic reticulum 
(ER) stress, Paneth cell impairment and spontaneous enteritis*. 
Unresolved ER stress is a common feature of inflammatory bowel 
disease epithelium*”, and several genetic risk factors of Crohn’s disease 
affect Paneth cells***°. Here we show that impairment in either UPR 
(Xbp1““) or autophagy function (Atg16l17”~ or Atg7“"9) in in- 
testinal epithelial cells results in each other’s compensatory engage- 
ment, and severe spontaneous Crohn’s-disease-like transmural ileitis 
if both mechanisms are compromised. Xbp1“““ mice show autopha- 
gosome formation in hypomorphic Paneth cells, which is linked to 
ER stress via protein kinase RNA-like endoplasmic reticulum kinase 
(PERK), elongation initiation factor 2a (eIF2a) and activating tran- 
scription factor 4 (ATF4). Ieitis is dependent on commensal micro- 
biota and derives from increased intestinal epithelial cell death, inositol 
requiring enzyme 1a ([RE1a)-regulated NF-«B activation and tumour- 
necrosis factor signalling, which are synergistically increased when 
autophagy is deficient. ATG16L1 restrains IRE14a activity, and aug- 
mentation of autophagy in intestinal epithelial cells ameliorates ER 
stress-induced intestinal inflammation and eases NF-KB overacti- 
vation and intestinal epithelial cell death. ER stress, autophagy induc- 
tion and spontaneous ileitis emerge from Paneth-cell-specific deletion 
of Xbp1. Genetically and environmentally controlled UPR function 
within Paneth cells may therefore set the threshold for the development 
of intestinal inflammation upon hypomorphic ATG16L1 function and 
implicate ileal Crohn’s disease as a specific disorder of Paneth cells. 

The UPR and autophagy are integrally linked pathways'°. To investi- 
gate their relationship in the intestinal epithelium, we stably transduced 
the small intestinal epithelial cell line MODE-K with a short hairpin 
Xbp1 (shXbp1) lentiviral vector (Extended Data Fig. 1a)*. We observed 
increased levels of ATG7, beclin 1 (Extended Data Fig. 1b) and phos- 
phatidylethanolamine (PE)-conjugated microtubule-associated protein 
1 light chain 3-B (LC3-II) relative to LC3-I compared to control-silenced 
(shCtrl) cells (Extended Data Fig. 1c). Increased autophagic flux accounted 


for this, given increased levels of LC3-II relative to LC3-I observed after 
inhibition of autophagosome-lysosome fusion by bafilomycin” (Extended 
Data Fig. 1d). Increased GFP-LC3 punctae were seen in shXbp1 com- 
pared to shCtrl MODE-K cells transfected with a green fluorescent 
protein (GFP)-LC3-expressing vector'* (Extended Data Fig. le, f), as 
well as increased numbers of autophagic vacuoles in shXbp1 compared 
to shCtrl cells (Extended Data Fig. 1g, h). Accordingly, isolated primary 
intestinal epithelial cells from the small intestine of villin (V)-cre*; 
Xbp" (hereafter called Xbp14""°)* mice backcrossed onto C57BL/6 
(B6) background exhibited nearly complete consumption of LC3-I anda 
relative increase in LC3-II (Fig. la and Extended Data Fig. 1i), stable 
amounts of ATG7 (presumably reflecting a combination of increased 
production and consumption), elevated levels of ATG16L1 and beclin 1 
(Fig. 1b and Extended Data Fig. 1j), and autophagosomes and degradative 
autophagic vacuoles consistent with autophagy induction in hypomorphic* 
Paneth cells, and to a lesser extent goblet cells (data not shown), com- 
pared to V-cre;Xbpi“' (hereafter called wild type) mice (Fig. 1c and 
Extended Data Fig. 1k). To gain temporal control of Xbp1 deletion and 
the ability to monitor autophagy in situ and exclude the role of chronic 
inflammation’ in this induction, we generated V-creER™’;Xbp i" ft 
(hereafter called Xbp1" 4’ mice ona B6 background crossed to GFP- 
LC3 transgenic mice’’. Three days after tamoxifen-induced Xbp1 dele- 
tion (Extended Data Fig. 2a), although mature Paneth cells remained 
present with little detectable inflammation (data not shown), punctate 
GFP signal accumulation was greatest at the bottom of the crypts of 
Lieberkiihn (Fig. 1d, e), and co-localized with lysozyme-positive Paneth 
cells (Extended Data Fig. 2b). Purified crypts of Xbp174""© mice 
revealed increased LC3-I/II conversion and reduced p62 compared 
to wild-type mice (Extended Data Fig. 2c). Thus, Xbp1 loss in intestinal 
epithelial cells induced autophagy most notably in Paneth cells. 
Although Xbp1-deficient intestinal epithelial cells exhibited broad 
evidence of ER stress* (Extended Data Fig. 2d-j), an examination of 
shXbp1, relative to shCtrl, MODE-K cells demonstrated a particularly 
significant increase in phosphorylated (p)-PERK and its substrate p-eIF2a 
(Fig. 1f), with the latter reversed by Perk (also called Eif2ak3) silencing, 
identifying it as the factor responsible for p-eIF2a formation (Extended 
Data Fig. 3a). Increased ‘pel Poe was also detected in primary intestinal 
epithelial cells of Xbp1“’"° (Fig. 1g and Extended Data Fig. 3b) and 
Xbp1" “""© mice (Extended Data Fig, 3c). Consistent with the involvement 
of PERK-elF2a in autophagy induction, ATF4, a transcriptional effector 
of this pathway, and its transcriptional target, C/EBP-homologous 
protein (CHOP; encoded by Ddit3), were increased in primary intestinal 
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Figure 1 | PERK-elF2a signalling induces autophagy in Xbp1-deficient 
intestinal epithelial cells. a, b, Immunoblot for LC3 conversion in isolated 
primary intestinal epithelial cells (a) (n = 5/4) and for autophagy proteins in 
primary intestinal epithelial cell scrapings (b) (n = 3). WT, wild type. 

c, Transmission electron microscopy (TEM) of f crypts. Note autophagic 
vacuoles in various stages of evolution in Xbp1“""° hypomorphic Paneth cells. 
d, e, Crypt showing GFP-LC3 punctae (d), quantified in e (n = 10; unpaired 
Student’s t-test; mean + s.e.m.). Scale bar, 5 um. f, g, Immunoblot of silenced 
MODE- K cells (f) and primary intestinal epithelial cell scrapings (g) for the 


epithelial cells of Xbp 14" mice (Fig. 1g and Extended Data Fig. 3b), and 
chromatin-immunoprecipitation (ChIP) with anti-ATF4 demonstrated 
increased binding to the Map 1Ic3b (LC3b) (Fig. 1h) and Atg7 (Fig. 1i) 
promoters, both of which contain ATF4 binding sites’, in shXbp1 rela- 
tive to shCtrl MODE-K cells. ATG7 is essential for the formation of the 
ATG12-ATGS conjugate during autophagy'*”*. shXbp1 MODE-K cells 
showed increased LC3b and Atg7 expression compared to shCtrl MODE-K 
cells (Extended Data Fig. 3d), and Perk co-silencing abrogated ATG7 
induction observed in shXbp1 compared to shCtrl MODE-K cells 
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PERK-elIF2e branch (n = 3). h, i, Promoter sequence qPCR for Map 1Ic3b 
(LC3b) (h) and Atg7 (i) after anti- ATF4 ChIP (unpaired Student’s t-test; 
mean ~ s.e.m.). j, GFP-LC3 punctae per crypt after treatment with tamoxifen, 
and vehicle or salubrinal (n = 10; one-way ANOVA with post-hoc Bonferroni; 
mean ~ s.e.m.). k, Enteritis histology score after salubrinal and tamoxifen 
co-treatment (n = 12/14/13; median shown; Kruskal-Wallis with post-hoc 
Holm’s-corrected Mann-Whitney U-test). Results represent three (a, f, g) or 
two (c, e, h, i) independent experiments. *P < 0.05, ***P < 0.001. 


(Extended Data Fig. 3a). Salubrinal, a selective inhibitor of elF2a 
dephosphorylation’* (Extended Data Fig. 2a), increased the accumulation 
of GFP-LC3 punctae primarily in Paneth cells, in both Xbp1-sufficient 
and -deficient intestinal epithelial cells (Fig. 1) and Extended Data 
Fig. 2b and 3e), and provoked increased levels of LC3-II relative to 
LC3-I (Extended Data Fig. 3f) and CHOP (Extended Data Fig. 3f) in 
Xbp1' “'"© mice, along with, importantly, an amelioration of the acute 
enteritis (Fig. 1k and Extended Data Fig. 3g). Similarly, silencing of 
iis arrest and DNA-damage-inducible protein 34 (Gadd34; also 


Figure 2 | Impairment of ER stress-induced 


oe compensatory autophagy results in severe 


transmural inflammation. a, TEM of crypts of 
Lieberkithn (n = 2). Scale bars, 2 um. 

b, Representative haematoxylin and eosin 
stainings. Note transmural inflammation 
extending through muscularis propria (white 
arrow) into serosa (black arrow) in Atg7/Xbp 14° 
and Atg16l1/Xbp1*""© mice scored in c and e. Scale 
bars, 50 im, except for lower two panels (10 Lm). 
c, Enteritis histology score (n = 26/12/18/27; 10- 
18 weeks; median shown; Kruskal-Wallis with 
post-hoc Holm’s-corrected Mann-Whitney 
U-test). d, Crypt TEM in indicated genotypes 

(n = 2). Scale bars, 2 um. e, Enteritis histology 
score (n = 11; 18 weeks; median shown; Kruskal— 
Wallis with post-hoc Holm’s-corrected Mann- 
Whitney U-test). NS, not significant. f, Xbp1 
mRNA splicing (Xbp1u, unspliced; Xbp1s, spliced) 
of crypts; densitometry in g (n = 7; unpaired 
Student’s t-test; mean + s.e.m.). h, GRP78 (green) 
immunofluorescence, white arrows indicate 
GRP78* crypts (DAPI, blue; n = 5). Scale bars, 
10 pm. *P < 0.05, **P < 0.01, ***P < 0.001. 
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called Ppp1r15a), part of the protein phosphatase 1 complex that depho- 
sphorylates eIF2a'*, led to increased eI[F2x phosphorylation and ATG7 
expression in shXbp1 compared to shCtrl MODE-K cells (Extended 
Data Fig. 3h, i). Xbp14""°;Gadd34"'~ mice with hypomorphic GADD34 
function exhibited increased p-eIF2« and ATG7 in purified crypt epithe- 
lial cells compared to Xbp14""°;Gadd34*'* mice (Extended Data Fig. 3)). 
Thus, PERK-p-elF2« is a critical mediator of UPR-induced autophagy 
primarily in Paneth cells consequent to XBP1 deficiency. 

These studies led us to propose that autophagy may function as a com- 
pensatory mechanism in intestinal epithelial cells upon sustained ER stress. 
We therefore generated V-cre* Ang -Xbp BM (Atg7/Xbp 14" mice". In- 
testinal epithelial cells from Atg7“""~ mice lacked LC3-II formation and the 
ATGI12-ATG5-ATGI16L1 complex (Extended Data Fig. 4a). Atg7/Xbp 14° 
mice demonstrated a complete absence of UPR-induced autophagy (Fig. 2a 
and Extended Data Fig. 4a-c), and a remarkable worsening of ileitis com- 
pared to Xbp1 AIEC mice. In notable contrast to Xbp1 AIEC mice, where inflam- 
mation was limited to the mucosa, >70% of Atg7/Xbp1“"“ mice developed 
discontinuous submucosal or transmural inflammation, characterized by 
acute and chronic inflammation extending in an abrupt knife-like fash- 
ion to muscularis propria and serosa, closely resembling the early 
fissuring ulcerations and fistulous tracts observed in human Crohn’s 
disease (Fig. 2b, c and Extended Data Fig. 4d). In contrast to Xbp14""° 
mice, enteritis in Atg7/Xbp1“""° mice progressed over the 18-week 
observation period such that at this time point all animals exhibited 
submucosal or transmural disease (Extended Data Fig. 4e, f). 

ATGI6L1 is a major genetic risk factor for Crohn’s disease’”’, espe- 
cially ileal Crohn’s disease’*. Complex formation of ATG16L1 protein 
with ATG12-ATGS defines the site of LC3 PE conjugation during auto- 
phagosome formation’”’”*. ATG16L1 protein expression was markedly 


increased in Xbp14"*© compared to wild-type primary intestinal epi- 


thelial cells (Fig. 1b and Extended Data Fig. 1j), presumably consequent 
to PERK/eIF20/ATF4-dependent transactivation of Atg7 and LC3b pro- 
moters and stabilization by the ATG7-induced ATG12-ATG5 complex”. 
We therefore developed mice with a floxed Atg161J allele that would allow 
for intestinal-epithelial-cell-specific deletion via V-cre (Atg16l1“""; Extend- 
ed Data Fig, 4g-i). Paneth cells in Atg16l1“”"° mice demonstrated a reduc- 
tion in their overall size and number of granules, similar to gene-trap- 
targeted Atgl6l1'™ mice (Extended Data Fig. 4j-n). Intestinal epithelial 
cells from Atg16l1““““ mice, compared to wild-type mice, exhibited reduced 
expression of ATG7 and the ATG12—ATGS conjugate (Extended Data Fig. 
5a), along with disruption of the secretory pathway with a distended ER, 
reduced size and number of secretory granules, a loss of homeostatic autop- 
hagy (Fig. 2d and Extended Data Fig. 5b, c) and increased p62 immuno- 
reactivity in crypts (Extended Data Fig. 5d). To address the role of ATG16L1 
under ER stress conditions, we generated V-cre’ ;Atgl ol :Xbp He" 
(Atgl6l1/Xbp1“"*°) mice. Atgl6l1/Xbp14"° mice, which lacked 
UPR-induced autophagy (Extended Data Fig. 5b, c), developed severe 
spontaneous ileitis compared to Xbp14""° or Atgi6l14""© mice, with 
discontinuous submucosal or transmural inflammation in >70% of 
18-week-old animals (Fig. 2b, e and Extended Data Fig. 5e, f) with 
features similar to those observed in Atg7/Xbp1“""© mice, and present 
in two distinct animal facilities (Fig. 2b, e and Extended Data Fig. 5g). 
This phenotype highlights the important compensatory role played by 
autophagy and in particular ATG16L1 in defending against inflam- 
mation arising from unabated ER stress precisely in the small intest- 
inal epithelium as a consequence of XBP1 deficiency. 

ATG7 hypofunction in hepatocytes can induce ER stress”, raising 
the possibility that cross-talk between the UPR and autophagy may be 
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Figure 3 | Autophagy restrains IRE1a-mediated NF-KB activation in 
Xbp!1-deficient epithelium. a, shCtrl or shXbp1 MODE-K cells were 
co-silenced for Atg16lJ (siAtg16l1) or with scrambled siRNA (siCtrl), and analysed by 
flow cytometry for annexin V and propidium iodide (PI; n = 3; one-way ANOVA 
with post-hoc Bonferroni; mean + s.e.m.). b, Immunoblot of primary intestinal 
epithelial cell scrapings (n = 3). c, Immunoblot of cytoplasmic extracts from shCtrl or 
shXbp1 MODE-K cells after TNF stimulation. d, TUNEL} intestinal epithelial cells 
(IECs) per 100 crypts after BAY11-7082 or vehicle treatment (n = 3/4/4; one-way 
ANOVA with post-hoc Holm’s-corrected unpaired Student’s t-test; mean + s.e.m.). 
e, Enteritis histology score of mice treated with BAY11-7082 or vehicle (n = 10/10/9; 
median shown; Kruskal-Wallis with post-hoc Holm’s-corrected 
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Mann-Whitney U-test). f, Immunoblot of intestinal epithelial cell scrapings for 
(p-)IRElo after IRElx immunoprecipitation (IP). B-Actin, loading control of 
whole lysates. g, Immunoblot of primary intestinal epithelial cell scrapings 

(n = 4).h, i, Enteritis histology score of indicated genotypes (h, n = 15/16/14/15; 
i, n = 5/10/12; median shown; Kruskal-Wallis with post-hoc Holm’s-corrected 
Mann-Whitney U-test). j, Enteritis histology score of specific pathogen-free 
(SPF) and germ free (GF) housed mice (n = 10/9/7/7; median shown; Kruskal- 
Wallis with post-hoc Holm’s-corrected Mann-Whitney U-test). 

k, Representative images of p-IxBo immunoreactivity under conditions as in 

j (n= 4). Scale bars, 20 um. Results represent four (f), three (c) or two 

(a, b) independent experiments. *P < 0.05, **P < 0.01, ***P < 0.001. 
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bi-directional in intestinal epithelial cells. Isolated Atgioli“'"© crypts 


exhibited increased Xbp1 splicing compared to wild type (Fig. 2f g) and 
increased Grp78 (also called Hspa5) expression (Extended Data Fig. 5h) 
localized to the crypt bottom (Fig. 2h). Dextran sodium sulphate, a 
colitis model involving ER stress in intestinal epithelial cells that can 
be treated with ER stress-relieving chaperones”**, induced more inflam- 
mation in Atg/6l1""° compared to wild-type mice (Extended Data Fig. 
5i-l), similar to Atg1611 HM nice’, Thus, disturbances in autophagy within 
intestinal epithelial cells also affect the UPR, and autophagy-associated 
factors such as ATG16L1 endow intestinal epithelial cells with the ability 
to mitigate ER stressors that are commonplace at the mucosal surface”. 
We next turned our attention to mechanisms by which autophagy 
counteracts ER stress and synergistically increases intestinal inflam- 
mation when absent. Increased numbers of TdT-mediated dUTP nick 
end labelling (TUNEL)* cells in Atg16l1/Xbp 14° and Atg7/Xbp17"° 
mice (Extended Data Fig. 6a, b) correlated with enteritis severity in 
double-mutant, in contrast to Xbp1 AIEC mice (Extended Data Fig. 6b), 
and, as demonstrated for Atg16l1/Xbp1“""“ mice, concomitantly with 
increasing age (Extended Data Fig. 6c and Supplementary Video 1). 
Silencing of Atg16l1 in shXbp1 MODE-K cells significantly increased 
the proportion of apoptotic cells in vitro (Fig. 3a and Extended Data 
Fig. 6d), indicating that increased apoptosis could function as an initial 
event in intestinal inflammation. Furthermore, intestinal-epithelial-cell- 
associated NF-«B activation, a critical player in intestinal inflammation”, 
was absent in primary intestinal epithelial cells from wild-type mice and 
gradually increased in Atg16l14°, Xbp14""° and Atg16l1/Xbp14""© mice 
(Fig. 3b and Extended Data Fig. 6e, f). shXbp1 MODE-K cells exhibited 
increased NF-«B activation when stimulated with tumour-necrosis 
factor (TNF) (Fig. 3c and Extended Data Fig. 6g, h) or Toll-like receptor 
ligands (data not shown) relative to shCtrl MODE-K cells, demonstrat- 
ing increased sensitivity of XBP1-deficient intestinal epithelial cells to 
inflammatory and environmental stimuli. Inhibition of NF-«B by treat- 
ment with BAY11-7082 decreased intestinal epithelial cell death in 
Xbp1"4""© mice (Fig. 3d and Extended Data Fig. 6i) and protected 
from enteritis in both Xbp1*4""° and Xbp1“"" mice (Fig. 3e and Extended 
Data Fig. 6j) relative to vehicle-treated mice. A progressive increase in 
intestinal epithelial cells of total and phosphorylated IRE1x (encoded 
by Ern1), the sensor of ER stress upstream of XBP1 and known to control 
NE-«B”, was observed in Atg16l14”"°, Xbp14"© and Atg16l1/Xbp14"° 
mice in comparison to wild-type mice (Fig. 3f) and mirrored the escala- 
ting elevations in NF-KB (Fig. 3b and Extended Data Fig. 6e, f). Indeed, 
increased NF-KB activity and ileitis were governed by IREl«, as the 
increased epithelial NF-kB phosphorylation observed in Xbp1“""© mice 
was abrogated in Ern1/Xbp1“""© mice (Fig. 3g), Ern1 co-silencing of 
shXbp1 MODE-K cells abolished the increased expression of the NF- 
«B target gene Nfkbia in response to TNF stimulation relative to shCtrl 
MODE-K cells (Extended Data Fig. 6k), and enteritis was diminished 
in Ern1/Xbp1“""° compared to Xbp14"© mice (Fig. 3h). Enteritis was 
also reversed by germline deletion of Tnfrsfla in Xbp 14""° mice (Fig. 3i), 
and by re-derivation of Xbp1“""“ mice into a germ-free environment 
(Fig. 3j), which was associated with reduced p-IkBa immunoreactivity 
in Xbp1-deficient intestinal epithelial cells compared to mice housed 
under specific pathogen-free (SPF) conditions (Fig. 3k). These studies 
together demonstrate that enteritis in this model is driven by TNF—the 
cytokine targeted by the most potent therapeutics of human Crohn’s 
disease**—and microbes in a pathway that derives from intestinal epi- 
thelial cell death and IREla-dependent activation of NF-«B, with 
ATGI6L1-dependent autophagy serving to restrain the inflammatory 
nature of the latter, probably through the removal of IRE1«. 
Accordingly, treatment of Xbp 1" 4" mice with the mTOR inhibitor 
rapamycin’? induced autophagy primarily in crypts (Extended Data 
Fig. 7a, b), diminished intestinal-epithelial-cell-associated NF-kB acti- 
vation and number of TUNEL" intestinal epithelial cells in Xbp 14° 
mice (Fig. 4a and Extended Data Fig. 7c-e), and markedly reduced the 
severity of enteritis (Fig. 4b). These beneficial consequences of rapamycin 
in the setting of unabated ER stress in intestinal epithelial cells were not 
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Figure 4 | ER stress-induced enteritis originates from Paneth cells and is 
alleviated through autophagy induction. a, Immunoblot of primary intestinal 
epithelial cell scrapings from mice treated with or without rapamycin for 14 
consecutive days (n = 3). b, Enteritis histology score for experiment as in 

a (n = 4; median shown; Mann-Whitney U-test). c, Representative images of 
EYFP-Rosa26/D6-cre*’ reporter mice and EYFP-Rosa26 (controls). 
Co-localization of Defa6 Cre-driven EYFP expression (yellow) with 
lysozyme-expressing Paneth cells (red; n = 3). DAPI, blue; scale bars, 50 um. 
d, Immunoblots of crypt intestinal epithelial cells from Xbp14”° and wild-type 
controls (n = 2). e, f, Representative confocal images of lysozyme (green) 
expressing Paneth cells (e) with quantification of crypts with indicated number 
of lysozyme™ granulated dots in f (n = 5; unpaired Student's ¢-test; 

mean ~ s.e.m.). DAPI, blue; scale bars, 10 tm. g, Immunohistochemistry for 
p-eIF2a (n = 3). Scale bars, 20 um. h, Representative haematoxylin and eosin 
images of Xbp14?° and wild-type mice scored in i. Scale bars, 50 jum. i, Enteritis 
scoring in Xbpi™" (WT), Defa6-cre* (WT) and Defa6-cre*;Xbp i" 
(xbp1*?9) mice (nm = 21/26/29; median shown; Kruskal-Wallis with post-hoc 
Holm’s-corrected Mann-Whitney U-test). Results represent two 

(b, d) independent experiments. *P < 0.05, **P < 0.01, ***P < 0.001. 


observed in Atg7/Xbp1“'"© and Atgl6l1/Xbp1“""© mice (Extended 
Data Fig. 7e-i), demonstrating that these effects required intact autop- 
hagy within intestinal epithelial cells. 

The spatial convergence of the consequences of hypomorphic UPR 
and autophagy function in Paneth cells prompted us to test the hypo- 
thesis that these pathways were interdependent in this cell type. We 
developed a defensin 6 alpha promoter’*-driven cre transgenic line (D6-cre), 
and confirmed the exclusive activity of Cre recombinase in lysozyme* 
Paneth cells (Fig. 4c). Paneth-cell-specific deletion of Xbp1 in D6- 
cre;Xbp i" (Xbp14"°) mice resulted in autophagy activation (Fig. 4d and 
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Extended Data Fig. 8a, b) and structural defects in granule morphology in 
Paneth cells (Fig. 4e, f). Crypts of Xbp1¢”° compared to wild-type mice 
exhibited increased GRP78 expression (Fig. 4d and Extended Data Fig. 8b) 
and p-elF2« immunoreactivity (Fig. 4g), accompanied by increased con- 
version of LC3-I/II and reduced p62 levels (Fig. 4d and Extended Data 
Fig. 8b), demonstrating ER stress and autophagy induction. Notably, 
75% of Xbp1“"° mice developed spontaneous enteritis (Fig. 4h, i) that 
shared the histological inflammatory characteristics observed in Xbp 14° 
mice (Fig. 2b). Xbp1“"° mice exhibited increased cell death in crypts (Ex- 
tended Data Fig. 9a, b) and intestinal epithelial cell turnover (Extended 
Data Fig. 9c-f), whereas goblet cells remained unaffected (Extended Data 
Fig. 9g, h). Notably, Atg7/Xbp1"° mice exhibited transmural disease as 
early as 8 weeks of age (Extended Data Fig. 91). We conclude that deletion 
of Xbp1 specifically in Paneth cells results in unresolved ER stress, UPR 
activation and induction of autophagy, which can serve as a nidus for the 
emergence of spontaneous intestinal inflammation that evolves into trans- 
mural disease in the absence of the compensation provided by autophagy. 
Our studies thus support a mechanistic model (Extended Data Fig. 
10a-c) for how ATG16L1-associated genetic risk may convert into a 
disease phenotype. The competence of the UPR probably sets the 
threshold for susceptibility of the host with hypofunctional autophagy 
to interacting genetic and environmental factors capable of inducing 
inflammation. Consistent with our model, patients with Crohn’s dis- 
ease carrying the ATGI6L1'°™ risk variant, which impairs autopha- 
gosome formation”, frequently exhibit ER stress in their Paneth cells, 
in contrast to those harbouring the normal variant”. Finally, our studies 
also unequivocally establish that these inflammatory susceptibilities can 
emerge directly from highly secretory Paneth cells and suggest that small 
intestinal Crohn’s disease may be a specific disorder of this cell type. 


METHODS SUMMARY 

Villin (V)-cre* ;Xbpe™ (xbp14™', V-creER™* ;Xbp¥™" (Xbp1" 4", Atg™™" (ref. 14), 
Atgi6l¥™" (generated as detailed in Methods) and Gadd34"~ mice", all backcrossed >8 
generations onto B6, and GFP-LC3 (ref. 12), Ern EP! and Tn ifrsfl a mice were used 
to generate experimental mouse strains. Experiments were performed with sex- 
and age-matched animals using littermates where technically possible. Generation 
of Paneth-cell-specific Defa6-cre;Xbp J" (Xbp14”) mice is described in Methods. 
Murine norovirus (MNV) status is reported in Extended Data Fig. 9j. Stable clones 
of the Xbp1-silenced (shXbp1) small intestinal epithelial cell line MODE-K were 
generated via a lentiviral shRNA vector and selected by hygromycin’. The sources 
of antibodies and reagents are listed in the Methods. Immunoblot, chromatin im- 
munoprecipitation, RT-PCR, histological analysis, transmission electron micro- 
scopy and confocal microscopy techniques are described in Methods. Statistical sig- 
nificance was calculated using an unpaired two-tailed Student’s t-test or a Mann- 
Whitney U-test and considered significant at P < 0.05. In experiments where more 
than two groups were compared, Kruskal-Wallis test followed by Mann-Whitney 
Uand post-hoc Bonferroni Holm’s correction or one-way ANOVA/Bonferroni were 
performed as appropriate. Data were analysed using GraphPad Prism software. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Mice. Xbpi™" mice were backcrossed >8 generations onto B6 and crossed with 
V-cre-ER’** (B6) mice provided by N. Davidson and S. Robine to generate mice 
with tamoxifen-inducible Xbp1 deletion in the intestinal epithelium (Xbp17 “9%, 
and with V-cre (B6) mice to generate mice with constitutively active Xbp1 deletion 
in the intestinal epithelium (Xbp14™°). V-cre-ER"?* recombinase was activated by 
3 or 5 daily intraperitoneal administrations of 1 mg tamoxifen (MP Biomedicals) 
as indicated in figure legends. V-cre* ;Xbp ¥“" (Xbp17""°) mice were crossed with 
Atg?"" mice provided by M. Komatsu to obtain V-cre* ;Atg7”";Xbp E™" (Atg7/ 
Xbp1“""5 mice. Mice with a floxed Atg16I1 allele were generated in collaboration 
with GenOway. Briefly, a proximal loxP site was introduced within the promoter 
region of the Atg16l1 gene upstream of exon 1, a distal loxP site was introduced with 
an FRT flanked neomycin selection cassette within intron 1. The resultant mouse 
line was bred with deleter-mice constitutively expressing Flp recombinase to remove 
the neomycin selection cassette, creating an Atg161/”* mouse in which Atg!6l1 exon 
1 was flanked by two loxP sites (Extended Data Fig. 4g). After backcrossing onto B6, 
these mice were crossed with V-cre* mice* resulting in V-cre;Atg16l/”" mice with 
intestinal-epithelial-cell-specific Atg1611 deletion (Atg16lI“""%), Atgi6l!™ mice 
were crossed with Xbp1“’"© mice to develop V-cre* ;Atgi6ll";Xbp 2" (Atgl6l1/ 
Xbp1*"®9 mice. GFP-LC3 transgenic mice’ (gift of N. Mizushima) were crossed 
with V-cre-ER'?* ;Xbp ¥" Ml +o generate GFP-LC3; V-cre-ER'?* ;Xbp i" Ut mice (GFP- 
LC3;Xbp1"4""°). For the generation of Paneth-cell-specific Defa6-cre;Xbpi! 
mice, a 1.1-kb cDNA fragment encoding improved Cre (iCre)’? recombinase 
was subcloned downstream of nucleotides —6,500 to +34 of mouse cryptdin-2 
gene (Defa6) in the BamHI site of the pCR2-TAg-hGH plasmid*** to replace the 
DNA fragment containing the simian virus 40 large antigen (SV40). A linearized 
10.2-kb fragment containing the Defa6 promoter, iCre and hGH (Defa6-iCre- 
hGH) was removed by EcoRI digestion, agarose gel-electrophoresed and purified 
with the Qiaex Gel Extraction kit (Qiagen), and used for pronuclear injection of B6 
mice. Six founders from 22 live-born mice were identified by screening tail DNA 
using iCre-specific primers, and two lines were further characterized and crossed 
to Xbpl, Atga and EYFP-Rosa26 reporter mice*’, provided by K. Rajewsky. 
Gadd34~ mice'® were crossed with V-cre;Xbp i" “Pl mice to generate V-cre;XbpV' Uf, 
Gadd34*"~ (Xbp1 TEC, Gadd34‘/-) mice. Erni" (ref. 36) and Tnfrsfl a’ (Jackson) 
mice were crossed with V-cre;Xbp i" mice to generate V-cre;Ern ve Xbp F up (Ern1/ 
Xbp14"®S) and V-cre;Xbp!"; Tnfrsfla- (Xbp 14" Tnfrsfla~-) mice, respectively. 
Cre transgenes were maintained in the hemizygous state in all experimental strains 
with a floxed allele to generate littermate controls. Xbp14""° mice were re-derived in 
a germ-free environment and housed in sterile isolators at the Taconic Farms bree- 
ding facility. Tail or ear biopsy genomic DNA was used for genotyping of respective 
mouse strains as described previously’. Primer sequences are available on request. 
Mice were housed in specific pathogen free (SPF) barrier facilities at Harvard 
Medical School (Xbp1“/"°, Xbp1™4""°, Atg74™°, Atg7/Xbp14"™°, Atg7/Xbp17", 
Xbp14"S, GFP-LC3;Xbp1"4"°, Ern1/Xbp1“"" mice and their respective controls), 
University of Cambridge (Xbp14"°, Xbp174"°, Atgi6li4®°, Atgl6l1/Xbp 14°, 
Xbp14"/Gadd34‘"-, Erni/Xbp1“""© mice and their respective controls), Innsbruck 
Medical University (Xbp14"°, Xbp1™4"S, Atg16l1“°, Atgl6ll/Xbp 14", Xbp14"7 
Tnfrsfla~ mice and their respective controls), and Christian-Albrechts-Universitit zu 
Kiel (Atg16l1 aIEC mice and their respective controls). Colonies maintained at 
Boston and Innsbruck were murine norovirus (MNV) positive by Taqman qRT- 
PCR (Extended Data Fig. 9)). Xbp14®S Atg16ll AIEC Brn“! and their associated 
double-mutant strains were re-derived from the Innsbruck colony into the MNV- 
free enhanced barrier Cambridge facility, and colonies confirmed MNV-negative 
by PCR (Extended Data Fig. 9j) and serology (data not shown), as were Atg16l1 AEG 
mice held at the Kiel facility. MNV Taqman qRT-PCR was performed as described*’. 
The phenotype of single- and double-mutant colonies that had been re-derived 
from the MNV~ Innsbruck facility into the MNV- Cambridge facility were indis- 
tinguishable, in particular relating to qualitative and quantitative measures of 
enteritis and the reciprocal induction of autophagy and ER stress. Mice were 
handled and all experiments performed in accordance with institutional guidelines 
and with the approval of the relevant authorities. 8-10-week-old mice were used 
for all experiments unless stated otherwise in the figure legend, and were randomly 
allocated into treatment groups. 

Antibodies and reagents. The following antibodies and reagents were used for 
immunoblotting. Sigma Aldrich: anti-LC3B (L7543); Cell Signaling Technology: 
anti-B-actin (4970; 13E5) anti-GAPDH (2118; 14C10), anti-eIF2« (9722), anti- 
phospho-elF2« (3597; 119A11), anti-PERK (3192; C33E10), anti-phospho-PERK 
(3179; 16F8), anti-JNK (9252), anti-phospho-JNK (4668; 81E11), anti- ATG5 (8540; 
D1G9), anti-beclin 1 (3495; D40C5), anti-ATG7 (8558; D12B11), anti-CHOP (5554; 
D46F1), anti-ATG12 (4180; D88H11), anti-p62 (5114), anti-IKK1 (2682), anti- 
IKK2 (2370; 2C8), anti-phospho-IKK1/2 (2697; 16A6), anti-IRE1o. (3294; 14C10), 
anti-phospho-NF-«B p65 (3033; 93H1), anti-NF-«B p65 (4764; C22B4) and anti- 
rabbit/mouse HRP antibodies (7074, 7076); Abcam: anti-phospho-IRE1« (48187); 
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MBL: anti-ATGI6L1 (M150-3; 1F12); Stressgen: anti-haem-oxygenase-1 (ADI- 
SPA-895); Novus Biologicals: anti-GRP78 (NBP1-06274); Santa Cruz Biotech- 
nology: anti-ATF4 (sc-200; C20). Immunoprecipitation antibody: anti-IREla 
(Santa Cruz Biotechnology, 20790; H190). Immunohistochemistry antibodies: 
Santa Cruz Biotechnology: anti-lysozyme (27958; C19); MBL: anti-ATG16L1 
(M150-3; 1F12); Cell Signaling Technology: anti-ATG16L1 (8089; D6D5), anti- 
phospho-IkBa (2859; 14D4), anti-phospho-eIF2« (3597; 119A11); Abcam: anti- 
Ki67 (15580), anti-GRP78 (21685); Progen: anti-p62 (GP62-C); BD Bioscience: 
anti-BrdU (551321). 

The following reagents were used: TNF (Peprotech, 315-01A), bafilomycin Al 
(Sigma Aldrich, B1793), rapamycin (LC Laboratories, R-5000), JNK inhibitor 
(Sigma Aldrich, SP600125), NF-«B inhibitor (Calbiochem, BAY11-7082) and 
salubrinal (Alexis Biochemicals, ALX-270-428) were dissolved in DMSO as recom- 
mended. N-acetyl cysteine (Sigma Aldrich, A9165) and glutathione (Calbiochem, 
NOVG3541) were used at final concentration of 1 mM. Ambion siRNA for Atg16l1 
(94892, sense 5’-GAACUGUUAGGGAAGAUCATT-3’, antisense 5’-UGAUCU 
UCCCUAACAGUUCCA-3’), Perk (65405, sense 5’-CCCGAUAUCUAACAGA 
UUUTT-3’, antisense 5’-AAAUCUGUUAGAUAUCGGGAT-3’; 65406, sense 5’- 
CGAAGAAUACAGUAAUGGUTT-3’, antisense 5'-ACCAUUACUGUAUU 
CUUCGTG-3'), Gadd34 (70230, sense 5’-CCAUAGCUCCGGGAUACAATT-3’, 
antisense 5'-UUGUAUCCCGGAGCUAUGGAA-3’; 70231, sense 5’-AGACAAC 
AGCGAUUCGGAUTT-3’, antisense: 5’-AUCCGAAUCGCUGUUGUCUTC-3’), 
Ernl (95857, sense 5’'-GUUUGACCCUGGACUCAAATT-3’, antisense 5’-U 
UUGAGUCCAGGGUCAAACTT-3’; 95858, sense 5’-GGAUGUAAGUGACCG 
AAUATT-3’, antisense 5’-UAUUCGGGUCACUUACAUCCTG-3’; 95859, sense 
5'-GCUCGUGAAUUGAUAGAGATT-3’, antisense 5’-UCUCUAUCAAUUCA 
CGAGCAA-3’) and scrambled control were used at a final concentration of 10 1M. 
Chromatin immunoprecipitation. ChIP with anti- ATF4 (Santa Cruz Biotechno- 
logy) and control IgG rabbit antibody was performed in Xbp1 and control silenced 
MODE-K cells according to ChIP protocol by Agilent. To determine the presence 
of ATF4 binding sites in the Atg7 promoter, a 4-kb region proximal to the tran- 
scription start site identified with the Eukaryotic Promoter Database Primers was 
analysed using MatInspector (Genomatix). Immunoprecipitated DNA was sub- 
ject to quantitative PCR (qPCR) to determine enrichment of ATF4 binding to 
respective promoters and results were normalized to input chromatin DNA. 
Primers used for qPCR were as follows for Map1LC3b (ref. 13), for Atg7 forward 
5'-GCGCTTCCGCGTTTGTGTGG-3’ and reverse 5'-CTGCTCCGCAACCAC 
GGCTT-3’. 

Salubrinal, rapamycin and BAY11-7082 treatment in vivo. Salubrinal 
el mgkg? d}), rapamycin (L5mgkg'd7') or vehicle (DMSO) was administered 
intraperitoneally (i.p.) 24h before the first tamoxifen administration to GFP- 
LC3;V-creER™;Xbp¥"" (GFP-LC3;wild type) and GFP-LC3;Xbp1"4""© mice. 
3-day treatment was used for evaluation of accumulation of GFP-LC3 punctae 
in the intestinal epithelium, whereas a 5-day combined tamoxifen and salubrinal 
treatment followed by two daily salubrinal injections was used in experiments 
with enteritis assessment as an end point (Extended Data Figs 2a and 3g). To 
assess the effects of rapamycin on ER stress-induced intestinal inflammation in 
XBPI deficiency, V-cre;Xbp i (Xbp17™9), V-cre;Atgi6lL™'-XbpL™ (Atgi6l1/Xbp14") or 
V-creAtg™™:Xbpe™ (Atg7/Xbp1*'"S) and the respective control mice were treated 
with rapamycin or vehicle for 14 consecutive days i.p. and inflammation was 
evaluated. BAY11-7082** or vehicle (DMSO) was administered ip. every other 
day at 5 mgkg’ ! for 14 consecutive days in Xbp1*""© mice, or for 5 consecutive 
days at 20mgkg ' in Xbp1"4"° mice concomitant with i.p. tamoxifen. 
Transmission electron microscopy. Small intestinal tissue from mice was handled 
by standard methods to be fixed with 1.25% glutaraldehyde, 4% formaldehyde in 
0.1 M cacodylate buffer at pH 7.4 at room temperature for electron microscopy. The 
detailed procedures for electron microscopy was previously described” and the 
tissue was observed with a JEOL 1400 transmission electron microscope at 120 kV 
operating voltage. For quantification of autophagy, number of autophagic vacuoles 
was manually counted by a TEM expert (J.H.) blinded to sample identity in 10 
consecutive Paneth cells per sample. ImageJ software was used to measure average 
size of autophagic vacuoles. 

Histology. Formalin-fixed and paraffin-embedded intestinal tissue was sectioned 
and stained with haematoxylin and eosin as previously described*. A semi-quantitative 
composite scoring system was used for the assessment of spontaneous intestinal 
inflammation, computed as a sum of five histological subscores, multiplied by a 
factor based on the extent of the inflammation. Histological subscores (for each 
parameter: 0, absent; 1, mild; 2, moderate; 3, severe): mononuclear cell infiltrate 
(0-3), crypt hyperplasia (0-3), epithelial injury/erosion (0-3), polymorphonuclear 
cell infiltrates (0-3) and transmural inflammation (0, absent; 1, submucosal; 2, one 
focus extending into muscularis and serosa; 3 up to five foci extending into mus- 
cularis and serosa; 4, diffuse). Extent factor was derived according to the fraction of 
bowel length involved by inflammation: 1, <10%; 2, 10-25%; 3, 25-50%; and 
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4, >50%. Ileal inflammation was assessed by an expert gastrointestinal pathologist 
(J.N.G.) who was blinded to the genotype and experimental conditions of the sam- 
ples. No spontaneous colonic inflammation was detected in any of the reported 
genotypes. 

Reactive oxygen species, cell death detection by flow cytometry and NF-KB 
activity assays. Stable clones of the Xbp1-silenced (shXbp1) small intestinal epi- 
thelial cell line MODE-K were generated via a lentiviral shRNA vector and selected 
by hygromycin, as previously described*. To evaluate oxidative stress, Xbp1- 
and control-silenced MODE-K cells*® were incubated with 54M 5-(and-6)- 
chloromethyl-2',7'-dichlorodihydrofluorescein diacetate, acetyl ester (CM-H2DCFDA) 
(Molecular Probes) for 30 min*’. After washing with PBS, cells were further incu- 
bated with complete medium for 2h. Reactive oxygen species generation was 
determined using flow cytometry. To evaluate cell death, shXbp1 and shCtrl cells 
were co-silenced for Atg16l1 using siRNA or scrambled control (Ambion). After 
4 days, cells were collected and stained for annexin V (Biolegend) in staining buffer 
(Biolegend) and mode of cell death was determined by flow cytometry after addi- 
tion of propidium iodide (PI). To assess NF-kB signalling pathway activation, 
Xbp1- or control-silenced MODE-K cells were stimulated with 50 ng ml ' TNE 
for indicated periods of time, followed by immunoblotting (using NE-PER 
(Thermo Scientific) isolated cytoplasmic extracts), qRT-PCR, and chemilumin- 
escent detection of NF-kB consensus sequence binding activity (in nuclear extracts 
isolated with NE-PER) with the NF-«B p65 transcription factor assay kit (Thermo 
Scientific). Ern1 or scrambled siRNA (Ambion) was used for co-silencing as indicated. 
Immunohistochemistry, BrdU and TUNEL labelling. Formalin-fixed paraffin- 
embedded sections were stained according to standard immunohistochemistry 
protocols and manufacturer’s recommendations as described previously*. Cell 
death was assessed by TdT-mediated dUTP nick end labelling (TUNEL) of form- 
alin-fixed paraffin-embedded slides of the respective genotypes using the TUNEL 
cell death detection kit (Roche). Entire slides were analysed for TUNEL" cells and 
numbers normalized to intestinal length on the slide. Proliferation of the intestinal 
epithelium was assessed after a 24-h pulse with 5-bromodeoxyuridine (BrdU; BD 
Pharmingen) and incorporated BrdU was detected by the BrdU in situ detection 
kit (BD Pharmingen). BrdU~ nuclei per total intestinal epithelial cells along the 
crypt villus axis are shown. Toluidine blue staining and Periodic acid-Schiff (PAS) 
reaction was performed according to standard protocols. 

Confocal microscopy for detection of GFP-LC3 and EYFP. For detection of 
GFP-LC3 or EYFP, mice were euthanized, followed by transcardiac perfusion with 
PBS (2-3 min) and 3.7% formaldehyde (3-4 min). Small intestine was dissected 
and promptly washed with PBS. The tissue was fixed in formalin for an additional 
12-18 h. Fixed tissue was embedded in OCT and sectioned on a cryotome into 
5 um sections. Slides were washed with PBS and mounted with Prolong Gold 
Antifade reagent with DAPI (Invitrogen). Images of the sections were collected 
using Olympus semi-confocal system. MetaMorph software was used for image 
analysis. For the detection of autophagosome formation in vitro, Xbp1 and control 
silenced MODE-K cells were transfected with a GFP-LC3 plasmid” (gift from 
N. Mizushima) with use of Lipofectamine LTX (Invitrogen) following the manu- 
facturer’s instructions. Accumulation of GFP-LC3 punctae or EYFP signal was 
assessed using LSM510 META confocal microscopy (Carl Zeiss). 

Intestinal epithelial cell purification and crypt isolation. Mice were euthanized 
and the intestine was washed with ice-cold PBS after being cut open longitudinally. 
Peyer’s patches were removed and the intestine was cut into small pieces. Mucus 
was removed by shaking the intestine in 1X HBSS containing 1mM DTT for 
10 min at room temperature. After washing with PBS, pieces were digested with 
dispase (1 Uml ' in RPMI with 2% FCS) for 30min at 37°C with shaking 
(250 r.p.m.). Cells were collected and debris removed with a 100 tm cell strainer, 
and centrifuged for 5 min at 1,500 r.p.m. Intestinal epithelial cells were collected in 
the top layer after 40-100% Percoll gradient centrifugation. Purity of the popu- 
lation was determined by staining with anti-EpCAM antibody and flow cytometry 
analysis. Intestinal epithelial cells were lysed with RIPA buffer and equal amounts 
of protein were used for western blot analysis as indicated in figure legends. To 
isolate small intestinal crypts, the intestine was flushed, cut open longitudinally 
and incubated on ice for 30 min in 2mM EDTA/PBS. Two sedimentation steps 
and application ofa cell strainer separated crypts from villi”, which were then used 
for RNA isolation (Qiagen), Xbp1 splicing assay, and for protein lysis with RIPA 
buffer and subsequent immunoblotting. 

Intestinal epithelial scrapings. Mice were euthanized, intestines collected and 
longitudinally opened, and immediately washed with ice-cold PBS. Intestinal 
epithelium was collected by scraping with glass slides and snap frozen into liquid 
nitrogen for further analysis. For protein analysis, intestinal epithelial scrapings 
were homogenized in RIPA buffer using a 25G needle with a syringe. Lysates were 
cleared by centrifugation and aliquots of protein were used for protein assessment 
using standard western blot or immunoprecipitation protocols as indicated. Iso- 
lation of mRNA from intestinal epithelial scrapings or MODE-K lysates and 


RT-qPCR was performed as described* using following pairs of primers. grp78 
5'-ACTTGGGGACCACCTATTCCT-3’ and 5’-ATCGCCAATCAGACGCTCC-3’; 
Gadd34_5'-CCCGAGATTCCTCTAAAAGC-3’ and 5’-CCAGACAGCAAGG 
AAATGG-3'; LC3b 5'-GCGCCATGCCGTCCGAGAAG-3’ and 5'-GCTCCCG 
GATGAGCCGGACA-3’; Xbp1 5'-AGCAGCAAGTGGTGGATTTG-3’ and 5’- 
GAGTTTTCTCCCGTAAAAGCTGA-3’; Atg7 5’-CCTTCGCGGACCTAAAGA 
AGT-3’ and 5'-CCCGGATTAGAGGGATGCTC-3’; Nfkbia 5’-TGAAGGACGA 
GGAGTACGAGC-3' and 5’-TTCGTGGATGATTGCCAAGTG-3’; B-actin 5’- 
GACGGCCAGGTCATCACTATTG-3’ and 5'-AGGAAGGCTGGAAAAGA 
GCC-3'. 

Dextran sodium sulphate induced colitis. Acute colitis was induced by adding 
4% DSS (TdB Consultancy) to drinking water ad libitum for five consecutive days. 
Daily disease activity index (DAI) was assessed evaluating weight loss, stool con- 
sistency and rectal bleeding according to Supplementary Table 1. A high resolution 
mouse endoscopic system (Hopkins) was used. Colitis severity was assessed by a 
semi-quantitative score consisting of two subscores; endoscopic tissue damage 
(0-3, where 0 is no damage, 1 is lymphoepithelial lesions, 2 is surface mucosal 
erosion or focal ulceration, and 3 is extensive mucosal damage with expansion into 
deeper structures of the bowel wall) and inflammatory infiltration (0-3, where 0 is 
occasional inflammatory cells in the lamina propria, 1 is increased numbers on 
inflammatory cells in lamina propria, 2 is confluence of inflammatory cells extend- 
ing into the submucosa, and 3 is transmural extension of the infiltrate). 

Xbp1 splicing assay and densitometric quantification. Xbp1 splicing was 
assessed as described previously‘. Briefly, RNA was isolated, reverse transcribed 
and amplified by RT-PCR with the following primers: Xbp1 sp. forward 5’-AC 
ACGCTTGGGAATGGACAC-3’; Xbp1 sp. reverse 5'-CCATGGGAAGATGTT 
CTGGG-3'. The PCR product of 171 (unspliced) and 145 (spliced) bp were 
resolved on a 2% agarose gel. Densitometric analysis for splicing assay and immu- 
noblots was performed with ImageJ. 

Statistical methods. Statistical significance was calculated as appropriate using an 
unpaired two-tailed Student’s t-test or a Mann-Whitney U-test and considered 
significant at P < 0.05. In experiments where more than two groups were compared, 
Kruskal-Wallis test followed by Mann-Whitney U-test and post-hoc Bonferroni 
Holm’s correction or one-way ANOVA/Bonferroni was performed. Grubb’s test 
was used as appropriate to identify outliers. Data were analysed using GraphPad 
Prism software. Experimental group sizes were based on the goal of achieving desired 
effect sizes typically of =2.0 standard deviations and a power of 0.9 on the assump- 
tion of a normal distribution, and therefore typically involved n = 6-10. 
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Extended Data Figure 1 | Autophagy induction in Xbp1-deficient intestinal 
epithelial cells. a, Xbp1 expression in shCtrl and shXbp1 MODE-K cells 

(n = 7/6; unpaired Student’s t-test; mean + s.e.m.). b, c, Immunoblot of shCtrl 
and shXbp1 MODE K cells. d, Immunoblot after autophagosome-lysosome 
fusion inhibition via bafilomycin in silenced MODE-K intestinal epithelial cells. 
e, Silenced MODE-K intestinal epithelial cells after GFP-LC3 reporter 
transfection (scale bars, 5 um) with green punctae per cell quantification in 

f (n = 14; unpaired Student’s t-test; mean + s.e.m.). g, TEM of shCtrl and 
shXbp1 cells. Note double-membraned structure with engulfed contents 
characteristic of autophagosomes (white arrows; n = 10). Scale bars, 0.5 jum. 
h, Quantification of occupied area and average size of autophagic vacuoles from 
g (n = 10; unpaired Student’s t-test; mean + s.e.m.). i,j, Densitometry of Fig. la 
(i; n = 5/4; unpaired Student’s t-test; mean + s.e.m.) and Fig. 1b (j; n = 3; 
unpaired Student’s t-test; mean + s.e.m.). k, Low-magnification (1,380, 
original magnification here and in the remainder of this legend) TEM image of 
Paneth cells from wild-type mice (1), demonstrating the abundant endoplasmic 
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reticulum (ER) and characteristic secretory granules at the apical, lumenally 
(L) oriented side. (2) Higher magnification (5,520) of inset ‘2’ in 

(1) demonstrating typical secretory granules in Paneth cells from wild-type 
mice. (3) High magnification (X 20,700) of inset ‘3’ in (1) illustrating a double 
membrane structure characteristic of an autophagosome (white arrow) in close 
proximity to the ER and a mitochondrion. (4) Low magnification (2,160) 
TEM image of Paneth cell remnants present in Xbp1@’"“ small intestinal crypts, 
which lack expansion of the ER and exhibit only minuscule granule remnants. 
(5) Higher magnification (9,000) of inset ‘5’ in (4), demonstrating degradative 
autophagic vacuoles (black arrows), in close proximity to mitochondria, and 
the virtual absence of ER membranes. (6) High-power (X 14,400) magnification 
of inset ‘6’ in (4), illustrating a double-membrane structure (white arrow) 
characteristic of autophagosomes, and a degradative autophagic vacuole (black 
arrow). ER, endoplasmic reticulum; L, lumen; M, mitochondrion; N, nucleus. 
Scale bars represent 2 1m, 0.5 pm and 200 nm, respectively. Results represent 
three (b, c) or two (d, k) independent experiments. *P < 0.05, ***P < 0.001. 
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Extended Data Figure 2 | Autophagy in Xbp1-deficient intestinal epithelial 
cells is induced by eIF2a phosphorylation but independent of JNK or 
oxidative stress pathways. a, Timeline for salubrinal experiment shown in 
Fig. 1j. b, Representative indirect immunofluorescence images of GFP-LC3 
punctae accumulation (green) in small intestinal sections co-stained with anti- 
lysozyme antibody (red) treated as in a (n = 3). DAPI, blue. Scale bars, 10 jm. 
c, Immunoblot of crypt lysates after 3 days of tamoxifen or vehicle 
administration (n = 3). d, e, Immunoblot of primary intestinal epithelial cell 
scrapings (d); densitometry in e (n = 3; unpaired Student’s t-test; 

mean + s.e.m.). f, g, Immunoblot of shXbp1 and shCtrl MODE-K cells. ER 
stress-induced Jun N-terminal kinase-1 (JNK1) has previously been connected 
in other cellular model systems to autophagy activation through 
phosphorylation of B-cell leukaemia 2 (Bcl-2) and its dissociation from beclin 


Vehicle Salubrinal Vehicle Salubrinal 


Lysozyme GFP-LC3 DAPI 


Merge 


GFP-LC3 WT GFP-LC3 Xbp1™4!&° 
i 
shCtrl shXbp1 
0 5 25 0O 5 25 SP600125 


16kDa- LC3-l 
140 — << —— <P ap ame | C3-|| 


ke — A a  p-6/F 20 


ke a i a a ae B-Actin 


j shCtrl shXbp1 
Tr 1 
$< BEE G 
= ZU), 
16kDa- —_——— == 1C3| 
4). — —<—<— = <> aa aa | C3-|| 
37kDa- p-elF2a 


kD-— o cmee eam (3-Actin 


1*, as have oxidative stress/free radicals and haem oxygenase-1 (HO-1) 
activation’. h, Intracellular ROS determined by dichlorofluorescein assay and 
mean fluorescent intensity (MFI) after vehicle or dichlorofluorescein diacetate 
(DCF-DA) treatment. i, Immunoblot of shXbp1 and shCtrl MODE-K cells after 
administration of the JNK inhibitor $P600125 (0, 5 or 25 uM) for 4h. Note the 
absence of an effect of SP600125 treatment on the conversion of LC3-I to 
LC3-II or the levels of p-eIF2a, thereby excluding a major contribution 

of the JNK pathway to autophagy induction in the presence of 
intestinal-epithelial-cell-associated XBP1 deficiency. j, Immunoblot of shXbp1 
and shCtrl MODE-K cells after N-acetylcysteine (NAC), glutathione (GSH) or 
vehicle treatment for 16h. Note the absence of an effect of the free radical 
scavengers on either of these markers of UPR-induced autophagy (LC3-II or 
p-elF2«). Results represent three (f-j) independent experiments. *P < 0.05. 
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Extended Data Figure 3 | ER stress-induced activation of PERK/elF2a 
induces autophagy in Xbp1-deficiency. a, Immunoblot of shXbp1 and shCtrl 
MODE-K cells co-silenced with Perk (siPerk) or scrambled siRNA. 

b, Cumulative densitometry of the immunoblot in Fig. 1g and two additional 
experiments (not shown, n = 10; unpaired Student’s t-test; mean = s.e.m.). 

c, p-eIF2« immunohistochemistry of small intestinal epithelium. Scale bars, 
50 um. d, mRNA expression levels of Map1Ic3b (LC3b) and Atg7 relative to 
Gapdh in shCtrl and shXbp1 MODE-K cells (n = 3; unpaired Student’s t-test; 
mean + s.e.m.). e, Accumulation of GFP-LC3 punctae after salubrinal and 
3-day tamoxifen treatment according to timeline in Extended Data Fig. 2a 
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(n = 5). Scale bars, 5 jum. f, Immunoblot of primary epithelial cell scrapings 
from small intestine upon vehicle or salubrinal treatment according to the 
timeline in g (m = 2). Note the increased relative levels of LC3 conversion and 
CHOP, a transcriptional target of ATF4, in salubrinal-treated mice. g, Timeline 
of salubrinal experiment. Results are reported in f and Fig. 1k. h, Gadd34 
expression in shCtrl and shXbp1 MODE-K cells co-silenced for Gadd34 
(siGadd34) and control (siCtrl) (one-way ANOVA with post-hoc Holm’s 
corrected unpaired Student’s t-test; mean + s.e.m.). i, Cells as in h were 
immunoblotted. j, Immunoblot of crypt lysates (n = 2/4/2). Results represent 
three (a, h, i) independent experiments. *P < 0.05, **P < 0.01, ***P < 0.001. 
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Extended Data Figure 4 | Absence of Atg7 in intestinal epithelium disrupts 
autophagy and exacerbates spontaneous enteritis of Xbp1 deficiency and 
development of conditional Atg1611 knockout mouse. a, Immunoblot of 
primary intestinal epithelial cell scrapings (n = 4/3/3/3). Note the loss of ATGS, 
ATG12 and ATGI6L1 proteins in ATG7-deficient mice. b, Representative high 
magnification TEM images (n = 2). Note intact ER in wild-type mice; severe 
distortion of ER in Atg7“""© mice; autophagic vacuoles with the characteristic 
double membrane (white arrow) in Xbp1r° mice; and the absence of ER and 
autophagic vacuoles in Atg7/Xbp 14° mice. Scale bars, 200 nm. ER, 
endoplasmic reticulum. ¢, Quantification of autophagic vacuoles (n = 10; 
unpaired Student’s t-test; mean + s.e.m.). d, High-magnification (400) 
haematoxylin and eosin images. Note knife-like extension of inflammation 
along blood vessels through whole thickness of muscularis propria into the 
serosa in Atg7/Xbp1“""© mice. Scale bars, 10 tm. e, Linear regression analysis 
for the correlation of inflammation with age. Each dot represents a single 
animal (R? and P value for deviation from 0 are shown). f, Deep inflammation 
score (as described in Methods) in Atg7/Xbp 14" mice plotted at indicated ages 
in weeks (n = 13/17/9; median shown; Kruskal-Wallis with post-hoc 
Holm’s-corrected Mann-Whitney U-test). g, Atg16l1 intestinal-epithelial-cell- 
specific knockout design. Exon 1 of the Atg1611 gene was flanked by two loxP 
sites and Atg16l1 deletion was mediated by Cre recombinase (Cre) under 
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control of the Villin (V-) promoter (V-cre*;Atgl6ll™" or Atgi6l14"”*). See 
Methods for a detailed description. h, Immunohistochemical staining (brown) 
with anti-ATGI16L1 antibody (MBL) (n = 2). Note cytoplasmic intestinal- 
epithelial-cell-specific ATG16L1 immunostaining in wild-type but not 
Atg16114""© mice with retained ATG16L1 expression in lamina propria 
mononuclear cells in Atg16l1 AIEC mice. Scale bars, 50 pum. i, Indirect 
immunofluorescence microscopy (green) with anti-ATG16L1 antibody 

(Cell Signaling Technology) (n = 2). Note the similar staining pattern as in 

h with cytoplasmic intestinal-epithelial-cell-specific ATG16L1 
immunoreactivity in wild-type but not Atg/6l1“""© mice and retained 
ATGIG6L1 expression in lamina propria mononuclear cells in Atg1611“"" mice. 
Dashed line denotes crypt unit. Scale bars, 20 jum. j, Schematic representation of 
lysozyme” granule allocation patterns (green): normal (D0), disordered (D1), 
depleted (D2), and diffuse (D3). k, 1, Lysozyme immunofluorescence (green) 
in crypts of Atg16114" and wild-type mice (k), quantified in 1 (n = 3; unpaired 
Student’s t-test; mean + s.e.m.) according to granule allocation patterns 
shown in j. Dashed line denotes crypt unit. DAPI, blue. Scale bars, 5 jum. 

m, n, Quantification of Paneth cells based on the number of granular vesicles 
stained with toluidine blue (m) as shown in n (” = 4; unpaired Student’s t-test; 
mean + s.e.m.). Scale bars, 5 um. *P < 0.05, **P < 0.01, ***P < 0.001. 
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Extended Data Figure 5 | Atg16I1 intestinal-epithelial-cell-specific deletion 
abrogates UPR-induced autophagy and exacerbates ER stress-induced 
spontaneous intestinal inflammation due to Xbp!1 deficiency. 

a, Immunoblot of primary intestinal epithelial cell scrapings. Note identical 
GAPDH loading control as in Fig. 3b (n = 3). b, Representative high- 
magnification TEM images (n = 2). Note intact ER in wild-type mice; severe 
distortion of ER in Atg16l1“""© mice; autophagic vacuoles with the 
characteristic double membrane (arrow) in Xbp1*° mice; and absence of ER 
and autophagic vacuoles in Atg16l1/Xbp1““© mice. Scale bar, 200 nm. ER, 
endoplasmic reticulum. c, Quantification of autophagic vacuoles (n = 10 
unpaired Student’s t-test; mean + s.e.m.). d, Representative images of p62 
immunostaining on small intestinal sections (n = 3). Brown staining indicates 
p62 with maximal signal intensity in the region of Paneth cells. Scale bars, 

20 um. e, High-resolution haematoxylin and eosin images. Note the extension 
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of the inflammation into muscularis propria (arrow) in Atg16l1/Xbp1““© mice. 


Scale bars, 20 um. f, Deep inflammation score (as described in Methods) in 
mice assessed at 18 weeks of age (n = 11; median shown; Kruskal-Wallis with 
post-hoc Dunn’s correction). g, Enteritis histology score (n = 14; 7-10-week- 
old mice housed in Innsbruck; median shown; Kruskal-Wallis with post-hoc 
Holm’s-corrected Mann-Whitney U-test). h, (RT-PCR of grp78 in primary 
intestinal epithelial cell scrapings (n = 7; unpaired Student’s t-test; 

mean ~ s.e.m.). i, Disease activity index during DSS colitis (n = 7/4, two-way 
ANOVA with post-hoc Bonferroni; mean = s.e.m.). j, Representative 
endoscopic images from the colon at day 5 of DSS treatment (n = 3). 

k, Representative haematoxylin and eosin staining of colonic sections from 
5-day DSS-treated and untreated mice (n = 7/4). Scale bars, 50 1m. 1, Colitis 
score after 5 days of DSS administration (n = 7/4, Mann-Whitney U-test). 
*P <0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Figure 6 | ER stress-induced enteritis of Xbp1 deficiency 
exacerbated by autophagy impairment correlates with age and cell death in 
intestinal epithelial cells and is dependent on IRE1a-mediated NF-KB 
activation. a, Representative images of TUNEL-labelled intestinal epithelium 
(brown) (n = 7). Scale bars, 50 pm. b, Correlation of TUNEL” cells with the 
severity of inflammation. Linear regression analysis of Atg16l1/Xbp1*""“ (left) 
and Atg7/Xbp 14" (right) with significant R’ and P value for deviation from 
zero shown (left panel, n = 5/5/7/7; right panel, n = 5). c, Linear least square 
regression analysis for the correlation of enteritis histology score with cell 
death and age of animals by genotype. Each dot represents a single animal (grey, 
wild type; yellow, Atg1 6l1“"®, blue, Xbp1 AEC, red, Atg1 6l1/Xbp14*° mice) and 
the plane represents the linear regression for the enteritis histology score as a 
function of age and TUNEL labelling for Atg16l1/Xbp17""© mice (n = 6/13/12/ 
12). Note that the severity of inflammation significantly correlates with 
numbers of TUNEL” intestinal epithelial cells and age only in Atg16l1/ 
Xbp1“""© mice (R* = 0.602, P = 0.016). The three-dimensional (3D) plot is also 
available online in video format (Supplementary Video 1). Regression analysis 
was performed using the R package lessR (http://cran.r-project.org/web/ 
packages/lessR/index.html), last accessed May 2013. d, shCtrl or shXbp1 
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MODE-K cells were co-silenced for Afg16l1 (siAtg16l1) or with scrambled 
siRNA (siCtrl), and analysed by flow cytometry for annexin V and propidium 
iodide (PI) uptake (one-way ANOVA with post-hoc Bonferroni; 

mean + s.e.m.). e, Densitometry of the immunoblot shown in Fig. 3b (n = 3; 
one-way ANOVA with post-hoc Holm’s corrected unpaired Student’s t-test; 
mean ~ s.e.m.). f, Immunohistochemical staining for p-IkBa in the small 
intestinal epithelium (n = 3). Scale bars, 20 um. g, NF-KB consensus sequence 
binding assay after stimulation of shCtrl and shxbp1 MODE-K cells for 5 and 
20 min with indicated concentrations of TNF. h, Nfkbia expression, a 
prototypic NF-«B-transactivated gene, after TNF stimulation of shCtrl and 
shXbp1 MODE-K cells (mean = s.e.m.). i, Representative images of 
TUNEL-labelled sections after administration of BAY11-7082 or vehicle 

(n = 3/4/4). Scale bars, 50 um. j, Enteritis histology score of mice treated with 
the NF-«B inhibitor BAY11-7082 or vehicle (n = 7/9/8; median shown; 
Kruskal-Wallis with post-hoc Holm’s-corrected Mann-Whitney U-test). 

k, Expression of Nfkbia in Ern1- and control-silenced shXbp1 and control 
MODE-K cells after TNF stimulation (mean = s.e.m.). Results represent three 
(h, k) or two (d, g) independent experiments. *P < 0.05, **P < 0.01, 

***P < 0.001. 
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Extended Data Figure 7 | Rapamycin induces autophagy in Paneth cells, 
reduces the severity of enteritis and the number of TUNEL* intestinal 
epithelial cells in Xbp1“"“ but not in Atg7/Xbp1““ or Atg16l1/Xbp17"© 
mice. a, b, Quantification of GFP-LC3 punctae accumulation after 3-day 
tamoxifen treatment and induction by rapamycin (a; n = 10; one-way ANOVA 
with post-hoc Bonferroni; mean + s.e.m.) and representative images in 

b(n = 3). Scale bars, 5 um. c, Densitometry of the immunoblot in Fig. 4a (n = 3; 
one-way ANOVA with post-hoc Holm’s-corrected unpaired Student’s t-test; 
mean + s.e.m.). d, Quantification of TUNEL” intestinal epithelial cells per cm 
of gut in mice treated with rapamycin or vehicle (n = 5; unpaired Student’s 
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t-test; mean + s.e.m.). e, Representative images of TUNEL-labelled sections 
from rapamycin- or vehicle-treated mice. Scale bars, 50 jum. f, g, Quantification 
of TUNEL” intestinal epithelial cells per cm of gut in mice treated with 
rapamycin or vehicle (f; n = 4/4/5/5/5/5; g; n = 5; one-way ANOVA with 
post-hoc Holm’s-corrected unpaired Student’s t-test; mean + s.e.m.). 

h, i, Enteritis histology scores of rapamycin- or vehicle-treated mice of 
respective genotypes (h; n = 8/9/6/9/12/13; i, n = 8/7/6/7/7/6) (median shown; 
Kruskal-Wallis with post-hoc Holm’s-corrected Mann-Whitney U-test). 
*P< 0.05, **P<0.01, ***P<0.001. 


mited. All rights reserved 


LETTER 


b 
0.6 ——— 6 = 
2 
© £ 
7 g £ 
§ 04 34 g 
<¢ oO a 
ay 2 o 
a Po a 
20.2 6 2 a 
pa} 
0 0 
Aj oO 
ss 
4° 


Extended Data Figure 8 | Autophagy induction in Paneth cells of Xbp1“”© 


mice. a, Representative TEM images of small intestinal crypts from Defa6-cre* 
(wild type) and Defa6-cre* ;Xbp if (xbp14?°) mice. (1) Low magnification 
TEM image of Paneth cells at the base of a small intestinal crypt in wild-type 
mice, demonstrating abundant ER and characteristic secretory granules at the 
apical, lumenally oriented side. Scale bar, 5 jim. (2) Higher magnification of 
inset ‘2’ in (1) with numerous secretory granules. Scale bar, 1 jum. (3) High 
magnification of inset ‘3’ in (1) with intact ER. Scale bar, 200 nm. 

(4) Low-magnification TEM image of Paneth cells in an Xbpi*” © small 
intestinal crypt. Note the reduced number of secretory granules, the exfoliating 
hypomorphic Paneth cell into crypt lumen (‘E, black dashed outline) and a 
transmigrating polymorphonuclear cell through the mucosa (white dashed 
outline). Scale bar, 5 um. (5) Higher magnification of inset ‘5’ in (4), 
demonstrating abnormalities in secretory granule maturation with numerous 
hypodense granules (*) and a distorted ER. Note the accumulation of 
electron-dense cargo indicative of degradative autophagic vacuoles 


(black arrow) in close proximity to the nucleus. Scale bar, 1 pm. (6) High 
magnification of the inset ‘6’ in (4), demonstrating degradative autophagic 
vacuoles filled with electron-dense material (black arrow). Note the double 
membrane structures characteristic for autophagosomes (white arrowheads). 
Scale bar, 200 nm. (7) Low-magnification TEM image of Paneth cells in an 
Xbp14?© small intestinal crypt. Note the disorganized ER, hypodense secretory 
granules (*), empty area after exfoliation of a Paneth cell into the crypt lumen 
(‘E’, black dashed outline) similar to image (4). (8) High magnification of inset 
‘8’ identified in (7), demonstrating an autophagic vacuole (black arrow) 
surrounded by a markedly distorted ER. Scale bar, 200 nm. (9) High 
magnification of the inset ‘9’ in (7), with additional examples of degradative 
autophagic vacuoles accompanied by a distorted ER. Representative of two 
independent experiments. Scale bar, 200 nm. ER, endoplasmic reticulum; 

L, lumen; M, mitochondrion; N, nucleus. b, Densitometry of the immunoblot 
in Fig. 4d (n = 2; unpaired Student’s t-test; mean + s.e.m.). *P < 0.05. 
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Extended Data Figure 9 | Phenotypic consequences of Xbp1 deletion in 
Paneth cells and characterization of murine norovirus status. a, b, TUNEL- 
labelled ileal sections (a) with quantification of TUNEL” cells per cm gut 
shown in b (n = 7/11; unpaired Student’s t-test; mean + s.e.m.). Scale bars, 
50 um. ¢, d, Ki67 immunohistochemical staining of ileal sections (c) with 
quantification of Ki67* cells per total intestinal epithelial cell count along the 
crypt-villus axis shown in d (” = 5; unpaired Student’s t-test; mean + s.e.m.). 
Scale bars, 20 um. e, f, BrdU-labelled ileal sections (e) after 24 h with 
quantification of BrdU* cells per total intestinal epithelial cell count along the 
crypt-villus axis in f (n = 5/3; unpaired Student’s t-test; mean + s.e.m.). Scale 
bars, 50 pum. g, h, PAS-stained ileal sections (g) with quantification of PAS* 
cells per total intestinal epithelial cell count along the crypt-villus axis shown in 
h (n = 5; unpaired Student’s t-test; mean + s.e.m.). Scale bars, 50 um. 
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i, Enteritis histology score of indicated genotypes (n = 3; median shown; 
Kruskal-Wallis). j, Taqman qRT-PCR for MNV of faecal samples in three 
different animal facilities (n = 3; mean + s.e.m.). Note detectable MNV 
genome copy numbers in Boston (Atg7/Xbp1“""°, Fig. 2c) and Innsbruck 
(Atgl6l1/Xbp1“""°, Extended Data Fig. 5g) colonies, and absence of MNV after 
re-derivation of colonies into the MNV-free Cambridge facility (Atg1611/ 
Xbp1“""S, Fig, 2e), yet similar levels of enteritis in Xbp1*""° and Atgl6l1/ 
Xbp1“""© mice maintained in Innsbruck (MNV*) and Cambridge (MNV_) 
(Fig. 2b, e and Extended Data Fig. 5g) and relatively more severe disease in 
xXbp1see mice from Boston (MNV~ ) (Fig. 2c). Sentinel mice from the 
Cambridge colony also tested negative for MNV by serological analysis (data 
not shown). These observations suggest that the inflammatory phenotypes 
observed are not necessarily dependent on MNV. *P < 0.05. 
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Extended Data Figure 10 | Model of the interplay between UPR-induced 
autophagy, NF-«B signalling and inflammation. a, Deletion of Xbp1 in the 
intestinal epithelium, specifically in Paneth cells, leads to ER stress and 
activation of the PERK-eIF2a branch of the UPR. ATF4, a transcriptional 
mediator of this pathway, transactivates genes essential for autophagosome 
formation, such as Map1Ic3b (LC3b) and Atg7, which catalyses the creation of 
the ATG12-ATG5 conjugate that stabilizes ATG16L1 through complex 
formation”. UPR-induced autophagy in the intestinal epithelium is essential 
for restoration of homeostasis and restraint of ER stress-induced intestinal 
inflammation due to XBP1 deficiency. Activation of the UPR in the setting of 
XBP1 deficiency results in activation of IRE1q, resulting in the recruitment of 
TRAF2 and activation of IKK2 leading to IkBu degradation*””**"*. As shown 
here, UPR-mediated autophagy serves an important role in restraining NF-«B 
activation, conceivably by removing hyperinflammatory ER membranes 
containing activated IRElo. Pharmacological augmentation of this 
compensatory autophagy-dependent mechanism via inhibition of e[F2« 
dephosphorylation through salubrinal, or via the mTOR inhibitor rapamycin, 
results in amelioration of UPR-induced enteritis, which is driven by the 
commensal microbiota, NF-«B, and TNF-R1 signalling in intestinal epithelial 
cells and myeloid cells, whereby the ligand TNF may originate from 
XBP1-deficient intestinal epithelial cells*. b, ATG16L1 deficiency in intestinal 
epithelial cells leads to ER stress as revealed through upregulation of the 
chaperone GRP78 in intestinal epithelial cells, increased expression of GRP78 
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protein in Paneth cells, increased IRE1& expression and increased splicing of 
Xbp1 mRNA in intestinal crypts as well as increased intestinal epithelial cell 
death. This leads to increased sensitivity of the epithelium to environmental 
triggers (for example, dextran sodium sulphate) that further challenge the UPR 
and its compensatory pathways. c, Deficiency of ATG16L1 or ATG7 in the 
intestinal epithelium results in abrogation of the compensatory autophagic 
mechanism that restrains IRE1o activity, conceivably via removal of 
hyperinflammatory ER membranes, and further fosters intestinal epithelial cell 
death in the context of ER stress due to Xbp1 deficiency, resulting in 
spontaneous transmural small intestinal inflammation that is associated with 
further increases in NF-«B activation and cell death via the mechanisms 
described in a. The UPR allows for responses to a variety of signals that have an 
impact on protein folding, including genetic (for example, rare XBP1 variants, 
ORMD13 as risk factor of inflammatory bowel disease**”), environmental 
(for example, low O, tension in the intestinal tract) and microbial factors (for 
example, microbial toxins such as trierixin“*), which determines the level of ER 
stress in the intestinal epithelium. UPR-induced autophagy function provides 
a buffer to cope with different levels of ER stress and vice versa. However, in the 
presence of genetic risk variants, such as ATGI6L1 (refs 1, 17, 49) or IRGM 
(ref. 50), which are relatively prevalent in the general population, this 
compensatory mechanism is impaired, resulting in development and/or 
exacerbation of intestinal inflammation in the setting of unabated ER stress. 
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Antibody-mediated immunotherapy of macaques 
chronically infected with SHIV suppresses viraemia 


Masashi Shingai'*, Yoshiaki Nishimura', Florian Klein”, Hugo Mouquet?, Olivia K. Donau', Ronald Plishka’, 
Alicia Buckler-White!, Michael Seaman‘, Michael Piatak Jr°, J effrey D. Lifson°’, Dimiter S. Dimitrov®, Michel C. Nussenzweig”°” 


& Malcolm A. Martin! 


Neutralizing antibodies can confer immunity to primate lentiviruses 
by blocking infection in macaque models of AIDS'*. However, earl- 
ier studies of anti-human immunodeficiency virus type 1 (HIV-1) 
neutralizing antibodies administered to infected individuals or 
humanized mice reported poor control of virus replication and the 
rapid emergence of resistant variants’. A new generation of anti- 
HIV-1 monoclonal antibodies, possessing extraordinary potency 
and breadth of neutralizing activity, has recently been isolated from 
infected individuals*. These neutralizing antibodies target different 
regions of the HIV-1 envelope glycoprotein including the CD4- 
binding site, glycans located in the V1/V2, V3 and V4 regions, and 
the membrane proximal external region of gp41 (refs 9-14). Here 
we have examined two of the new antibodies, directed to the CD4- 
binding site and the V3 region (3BNC117 and 10-1074, respect- 
ively), for their ability to block infection and suppress viraemia in 
macaques infected with the R5 tropic simian—human immunodefi- 
ciency virus (SHIV)-AD8, which emulates many of the pathogenic 
and immunogenic properties of HIV-1 during infections of rhesus 
macaques’*”*. Either antibody alone can potently block virus acquisi- 
tion. When administered individually to recently infected macaques, 
the 10-1074 antibody caused a rapid decline in virus load to unde- 
tectable levels for 4-7 days, followed by virus rebound during which 
neutralization-resistant variants became detectable. When adminis- 
tered together, a single treatment rapidly suppressed plasma virae- 
mia for 3-5 weeks in some long-term chronically SHIV-infected 
animals with low CD4* T-cell levels. A second cycle of anti-HIV-1 
monoclonal antibody therapy, administered to two previously treated 
animals, successfully controlled virus rebound. These results indi- 
cate that immunotherapy or a combination of immunotherapy 
plus conventional antiretroviral drugs might be useful as a treat- 
ment for chronically HIV-1-infected individuals experiencing immune 
dysfunction. 

SHIV-AD8 was selected as challenge virus for this study because 
several clinical features observed during infections of macaques were 
similar to those reported in HIV-1-infected individuals. SHIV-AD8 
consistently establishes sustained set-point viraemia in macaques inocu- 
lated by the intravenous or intrarectal routes and causes unrelenting 
depletion of CD4* T lymphocytes'®"*. During the acute infection, 
SHIV-AD8 targets memory CD4" T cells in blood and at effector sites 
in tissues. Their gradual depletion is subsequently followed by the loss 
of the naive CD4" T lymphocyte subset. The latter heralds the onset of 
symptomatic immunodeficiency in macaques characterized by the 
development of opportunistic infections (Mycobacterium, Pneumocystis, 
Cryptosporidium species), lymphomas, marked weight loss, and death 
within 2-4 years of virus inoculation. In addition, SHIV-AD8-infected 
macaques generate cross-reactive antibodies, capable of neutralizing 


tier 1 and tier 2 HIV-1 isolates, including one ‘elite neutralizer’ macaque 
producing potent cross-clade neutralizing activity*'”"®. As is frequently 
the case for HIV-1 elite neutralizers, resistant variants emerged in this 
SHIV-AD8-infected animal, which succumbed to AIDS at week 117 
after infection”. 

The neutralization sensitivities of SHIV-AD8EO (ref. 17)—a molecularly 
cloned derivative of SHIV-AD8 (ref. 16)—to the 10-1074 (ref. 20), 
3BNC117 (ref. 21) and VRCO1 (ref. 22) monoclonal antibodies were 
measured in the TZM-bl cell assay (Fig. 1a). The half-maximum inhib- 
itory concentration (ICs) values determined for 10-1074, 3BNC117 
and VRCO1 against SHIV-AD8EO were 0.20 pgml',0.14pgml | and 
0.63 ug ml *, respectively, indicating that the 10-1074 and 3BNC117 
monoclonal antibodies had similar activities and either one was more 
potent than VRCO1 against SHIV-AD8EO in vitro. 

A more critical test of neutralization efficacy is the prevention of 
virus acquisition in vivo after passive transfer of neutralizing antibodies 
and subsequent virus challenge. This was examined by administering 
the 10-1074 or 3BNC117 monoclonal antibodies 24 h before an intrar- 
ectal challenge of Indian origin rhesus macaques with 1,000 tissue 
infectious dosesy (TCIDs9) (approximately 3 animal infectious dosess9 
(AIDso)"*), an inoculum size we have previously determined is suffi- 
cient to establish SHIV-AD8EO infections in vivo after a single inocu- 
lation by this route in 10 of 10 macaques. As shown in Fig. 1b, the 
transfer of the 10-1074 monoclonal antibody at a dose of 20 mgkg * or 
5 mgkg | to macaques prevented virus acquisition in 2 of 2 and 2 of 2 
macaques, respectively. The administration of 1 mgkg * of 10-1074, 
however, failed to protect either of two animals. For the 3BNC117 
passive transfer, the monoclonal antibody in vivo titration was initiated 
at a dose of 5mgkg_', which blocked virus acquisition in 2 of 2 maca- 
ques (Fig. 1c). In contrast, 2 of 2 macaques became infected when the 
dose of 3BNC117 was reduced to 1 mgkg™*. The plasma concentra- 
tions at the time of challenge for both monoclonal antibodies were 
comparable (approximately100 jg ml‘) in the four macaques treated 
with a dose of 5mgkg 

Recent studies have reported that combinations of three or more of 
the newly cloned broad and potent antibodies effectively suppressed 
HIV-1 viraemia in humanized mice!®”**, However, humanized mice 
carry a lower viral load, and do not have an intact adaptive or innate 
immune system. We have also previously reported that in the presence 
of neutralizing antibody, the clearance of circulating HIV-1 in vivo is 
accelerated from approximately 20 min to 3 or 4 min”. The potential 
therapeutic benefit of anti- HIV-1 neutralizing antibodies controlling 
virus replication in infected macaques was evaluated by administering 
either 10-1074 or 3BN117 alone (monotherapy) to two animals, 12 weeks 
after virus inoculation, when post-peak set-point viraemia levels had 
been established. On the basis of the pre-exposure prevention experiments 
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Figure 1 | HIV monoclonal antibodies block SHIV acquisition. 

a, Neutralization of SHIV-AD8EO by three anti- HIV-1 monoclonal antibodies 
assayed in TZM-bl cells. b, Pre-exposure passive transfer of 10-1074 
monoclonal antibody to macaques followed by SHIV-AD8EO intrarectal 
challenge (20mgkg ' (top), 5mgkg | (middle) and 1 mgkg ' (bottom)). 

c, Pre-exposure passive transfer of 3BNC117 monoclonal antibody to macaques 
followed by SHIV-AD8EO intrarectal challenge. Macaques ML1 and MAA 
received 20 mg kg of control anti-dengue virus NS1 IgG1 monoclonal antibody. 


described above, infected macaques were treated with either antibody 
at a dose of 10mg kg * (Table 1). Both recipients of the 3BNC117 
monoclonal antibody experienced rapid declines of plasma viraemia to 
background levels at day 10 after the start of treatment (Fig. 2a). Virus 
rebound became detectable on day 20 in these two macaques. The 10- 
1074-treated animal MCN sustained an unexpectedly rapid and a 
greater than 10° reduction of plasma viral loads to undetectable levels 
by day 6 of treatment initiation. However, the effects of 10-1074 mono- 
clonal antibody administration were more variable and of short duration 


(4 days) (Fig. 2a and Extended Data Fig. 1). Furthermore, single gen- 
ome amplification (SGA) analysis of virus in the four monotherapy 
recipients revealed that the rebound virus present in both of the 10- 
1074-treated macaques had sustained changes that eliminated the 
gp120 Asn 332 glycan, rendering the virus resistant to this antibody 
(Extended Data Fig. 2)!°. 

The known failure of antiretroviral drug monotherapy to durably 
control HIV-1 and SIV replication in vivo prompted us to co-administer 
3BNC117 and 10-1074 monoclonal antibodies, each at a dose of 
10mgkg  ', to chronically SHIV-AD8EO-infected macaques. In addi- 
tion, the capacity of the molecularly cloned SHIV-AD8EO to cause 
unrelenting CD4* T-cell depletion and symptomatic immunodefi- 
ciency provided the opportunity to assess the potential therapeutic 
effects of combination anti-HIV-1 neutralizing monoclonal antibodies 
in two groups of infected macaques experiencing the pathogenic effects 
of SHIV-AD8EO infection. The first group consisted of two clinically 
asymptomatic animals (DBZ3 and DC994A) that had been infected for 
159 weeks and had sustained similar and significant declines of cir- 
culating and bronchoalveolar lavage (BAL) CD4" T cells (Table 1 and 
Extended Data Figs 3 and 4). At the time of monoclonal antibody 
administration, the plasma viral loads in macaques DBZ3 and DC99A 
were 1.08 X 10* and 7.6 X 10° RNA copies ml~’, respectively. Both 
macaques responded to combination anti-HIV-1 monoclonal antibody 
treatment with immediate and rapid reductions of plasma viraemia to 
undetectable levels within 7-10 days (Fig. 2b, c). Suppression of mea- 
surable SHIV-AD8EO in the plasma of macaques DBZ3 and DC99A, 
after a single administration of the two monoclonal antibodies, lasted 
18 and 36 days, respectively, exceeding the 4-7 day window of virus sup- 
pression observed during monotherapy. In each case, plasma viraemia 
rebounded to pre-treatment levels. A second cycle of combination 
immunotherapy was administered to macaques DBZ3 and DC99A 
to determine whether the re-emergence of detectable plasma viraemia 
simply reflected insufficient levels of circulating antibodies in these 
animals. As shown in the right panels of Figs 2b, c, the viral loads in 
each animal became undetectable by day 7 of the second treatment 
cycle. Viraemia was suppressed for 4days in macaque DBZ3 and 
28 days in macaque DC99A. SGA analysis demonstrated that the 
rebound virus that emerged in macaque DC99A, after the second 
round of immunotherapy, carried a gp120 Asn332Asp mutation, indi- 
cating that resistance to the 10-1074 component of the antibody com- 
bination had occurred (Extended Data Fig. 5b). 

Combination monoclonal antibody therapy was also evaluated in a 
second group of three chronically infected animals (DBX3, DCF1 and 
DCM8s), which had also been infected for more than 3 years but were 
clinically symptomatic, experiencing intermittent diarrhoea and or 
anorexia (Table 1). At the time of monoclonal antibody administra- 
tion, the level of circulating CD4* T cells in one of these macaques 
(DCMS8) was only 43 cells pl~’ and was higher in animals DCF1 
(105 cells pl~') and DBXE (158 cells pl *). Plasma viral loads exceeded 
10° RNA copies ml in animals DBXE and DCF1 and were signifi- 
cantly lower (1.59 X 10° RNA copies ml ') in macaque DCMS. As 
shown in Fig. 2d-f and Extended Data Fig. 3, the administration of 


Table 1 | SHIV-AD8-infected macaques treated with anti-HIV monoclonal antibodies 


Animal Pre-infection Treatment initiation Pre-monoclonal antibody treatment Clinical status Monoclonal antibody administered 
(CD4* T cells (weeks after infection) CD4* T cells (cells pl?) Viral load 3BNC117 10-1074 
(cells pl~4)) (RNA copies m!~*) (mg kg?) (mg kg” +) 
KZ6 881 12 1,447 5.22 x 10° Post-acute set point 10 0 
MB6 545 12 1,598 7.43 x 10° Post-acute set point 10 0 
MB8& 1,512 12 2,625 1.14 x 104 Post-acute set point 0 10 
MCN 210 12 599 2.14 x 10* Post-acute set point ) 10 
DBZ3 650 159 118 1.08 x 104 Asymptomatic 10 10 
DC99A 623 159 165 7.60 x 10° Asymptomatic 10 10 
DBXE 1,585 163 158 1.96 x 10° Intermittent diarrhoea 10 10 
DCF1 1,203 157 105 1.44x10° Intermittent diarrhoea 10 10 
DCM8 608 163 43 1.59 x 108 Intermittent diarrhoea 10 10 
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Figure 2 | Suppression of plasma viraemia after monotherapy or 
combination anti-HIV-1 neutralizing antibody treatment. a, Plasma viral 
loads in post-acute set-point SHIV-infected rhesus macaques with/without 
single monoclonal antibody treatment. b-f, Plasma viral loads and total CD4* 
T-cell numbers in chronically SHIV-infected rhesus macaques (DBZ3 

(b), DC99A (c), DBXE (d), DCF1 (e) and DCM8 (f)) after combination 
monoclonal antibody treatment. 


the two monoclonal antibodies to macaque DBXE resulted in reduc- 
tion of viraemia from 2.0 X 10° RNA copies at day 0 to undetectable 
levels in plasma at day 20. This was followed, within a few days, by a 
resurgence of high levels of circulating virus in DBXE. Macaque 
DCM8, with more modest plasma virus loads and very low numbers 
of circulating CD4* T cells, experienced a rapid decline of viraemia 
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Figure 3 | Plasma viraemia rebounds in SHIV-infected macaques when 
neutralizing antibody levels decline. a-k, Viral RNA levels and 
concentrations of 10-1074 or 3BNC117 monoclonal antibodies in plasma at 
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to undetectable levels between days 6 and 17 after the initiation of 
monoclonal antibody treatment. Finally, animal DCF1, previously 
reported to have generated broadly reacting anti-HIV-1 neutralizing 
antibodies!’, exhibited a transient and a comparatively modest 27-fold 
reduction of plasma viraemia by day 6 in response to combination 
monoclonal antibody therapy, before the viral loads returned to high 
pre-treatment levels. 

SGA analysis of emerging virus populations in the five long-term 
chronically infected macaques revealed no changes affecting sensitivity 
to either 10-1074 or 3BNC117 except for the rebounding virus after the 
second treatment cycle of macaque DC99A noted above (Extended 
Data Fig. 5). 

Because some of the recipients of combination antibody treatment 
had prolonged control of detectable plasma viraemia, peripheral blood 
mononuclear cell (PBMC)-associated viral RNA and DNA levels were 
evaluated before and after monoclonal antibody administration. As 
shown in Extended Data Table 1, monoclonal antibody treatment 
resulted in reduced levels of cell-associated viral RNA in each animal 
contemporaneous with the suppression of its plasma viraemia. No 
consistent change in cell-associated viral DNA levels occurred as a 
result of antibody treatment. 

Co-administration of neutralizing monoclonal antibodies to chron- 
ically SHIV-AD8EO- infected macaques resulted in a transient eleva- 
tion of circulating CD4* T-cell levels, particularly in late-stage animals 
with very high viral loads. The CD4* T-cell numbers in macaques 
DBXE and DCF1 increased 2-3-fold during the period of monoclonal- 
antibody-mediated virus suppression, but gradually declined to pre- 
treatment levels as viraemia again became detectable (Extended Data 
Fig. 6). The corresponding elevations and declines of both the naive 
and memory CD4* T-cell subsets in animals DBXE and DCF1 would 
be consistent with an antibody-mediated tissue redistribution mech- 
anism, given that the R5 tropic SHIV-AD8EO exclusively targets 
memory, not naive, cpD4* T lymphocytes. The levels of memory 
CD4* T cells at an effector site (from BAL) did not change appreciably 
during immunotherapy (Extended Data Fig. 4). 
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various times after initiation of single or combination antibody treatment. In 
panels c, d, h, the gp120 changes present in the rebound virus populations and 
their frequencies are indicated at the bottom. 
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Plasma concentrations of each monoclonal antibody were deter- 
mined by measuring the neutralizing activity in plasma against selected 
HIV-1 pseudovirus strains sensitive to one or the other, but not to both, 
antibodies (Extended Data Fig. 7). In treated animals, suppression of 
SHIV-AD8EO viraemia was maintained until a threshold plasma 
monoclonal antibody concentration of approximately 5 ug ml * was 
reached, except when escape variants emerged in animals MB8, MCN 
and the second cycle of treatment in macaque DC99A (Fig. 3 and 
Extended Data Fig. 8). This was even the case for macaque DCF1, for 
which only a modest and transient reduction of plasma viral RNA levels 
was observed after combination therapy. Notably, the monoclonal 
antibodies administered to this clinically symptomatic macaque had a 
shortened half-life or were undetectable (Fig. 3j). In this regard, macaque 
DCF1 had to be euthanized on day 56 after treatment initiation and a 
necropsy revealed severe enteropathy, characterized by disseminated 
gastrointestinal cryptosporidiosis, pancreatitis and cholangitis, perhaps 
explaining the rapid loss of neutralizing antibodies in this animal. 

The decay rate constants and the corresponding half-lives of plasma 
viral RNA after monotherapy or combination neutralizing antibody 
treatments were calculated from the initial slopes of the HIV-1 plasma 
RNA decline (Extended Data Table 2). Furthermore, unlike pharmaco- 
logical antiretroviral therapy (ART) in which two phases of viral decay 
lasting several weeks is commonly seen, treatment with neutralizing 
antibodies induced a single phase of exponential decline (t)/. = 1.13 days). 
For the macaques (DC99A and DBZ3) that received two cycles of 
treatment, the decay rate constant for the second treatment was higher 
than that for the first treatment for one of the animals and lower for the 
other (Extended Data Table 2). 

Our findings demonstrate that combination anti-HIV neutralizing 
monoclonal antibody treatment is superior to monotherapy and can 
rapidly and potently suppress plasma viraema in chronically SHIV- 
infected macaques with low CD4* T-cell levels and symptomatic disease. 
Animals administered a single neutralizing monoclonal antibody experi- 
enced a relatively short period of plasma viral RNA suppression, and 
neutralization-resistant progeny virus emerged in some macaques, 
attesting to the robustness of SHIV-AD8EO in vivo. Our results further 
indicate that administration of monoclonal antibodies, alone or in 
combination with currently available antiretroviral drug regimens”, 
could be useful therapeutically, particularly for chronically infected 
individuals with compromised immune systems. 


METHODS SUMMARY 


The neutralization activities of the broadly acting 3BNC117 (ref. 21) CD4-binding 
site and 10-1074 (ref. 20) gp120 Asn-332-dependent neutralizing monoclonal 
antibodies against SHIV-AD8EO were initially determined in the TZM-bl cell 
system against SHIV-AD8EO as previously described'*"”. Their capacities to block 
virus acquisition or to control plasma viraemia in chronically infected animals 
challenged with the R5-tropic SHIV-AD8EO were assessed by monitoring plasma 
viral loads and cell-associated viral nucleic acids; levels of CD4* T-cell subsets 
were measured by flow cytometry as previously reported'®. SGA analyses of cir- 
culating viral variants and the determination of antibody levels in plasma have 
been previously described”*. Plasma concentration of neutralizing antibodies was 
determined by measuring neutralizing activity against HIV-1 pseudovirus pre- 
parations only susceptible to either 10-1074 or 3BNC117. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Virus. The origin and preparation of the tissue-culture-derived SHIV-AD8EO 
stock have been previously described’”. 

Animal experiments. Twenty-three male and female rhesus macaques (Macaca 
mulatta) of Indian genetic origin ranging from 2 to 10 years of age were maintained 
in accordance with Guide for the Care and Use of Laboratory Animals Report no. 
NIH 85-23 (Department of Health and Human Services, Bethesda, Maryland, 
1985) and were housed in a biosafety level 2 NIAID facility. Phlebotomies, euthanasia 
and sample collection were performed as previously described”. All animals were 
negative for the MHC class I Mamu-A*01, Mamu-B*08 and Mamu-B* 17 allele. 

Antibodies. Two monoclonal antibodies 3BNC117 (ref. 21) and 10-1074 (ref. 20) 
were isolated and produced as described elsewhere. DEN3, a dengue virus NS1- 
specific human IgG1 antibody’, was used as the negative control antibodies in this 
study. All antibodies were administered intravenously. 

Quantification of viral nucleic acids. Viral RNA levels in plasma were determined 
by real-time reverse transcription-PCR (ABI Prism 7900HT sequence detection 
system; Applied Biosystems) as previously reported”. Ultrasensitive measurement 
of cell-associated SIV RNA and DNA were determined by a nested, hybrid real- 
time/digital PCR assay”*. 

Neutralization assays. The in vitro potency of each monoclonal antibody was 
assessed by TZM-bl entry assay with pseudotyped SHIV-AD8EO as previously 
reported’*””, 

Lymphocyte immunophenotyping. EDTA-treated blood samples were stained 
for flow cytometric analysis as previously described’®. 

Antibody concentrations and t,,;2 estimations in plasma. Plasma concentra- 
tions of 10-1074 and 3BNC117 neutralization activity were separately determined 
against HIV-1 virus strains that are sensitive to one but not the other monoclonal 
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antibody as well as not, or very weakly, sensitive to autologous antibodies in 
macaque plasma. All samples were heat-inactivated for 1h at 56 °C and neutral- 
izing activity was measured by using a TZM-bl assay as previously described”. The 
half-maximum inhibitory dose (IDs9) titres were determined by serial plasma 
dilutions starting at 1:20; antibody concentrations were calculated by multiplying 
each IDso value. t1/2 of plasma monoclonal antibodies was estimated by In(2)/k. 
Single genomic analysis. Single-genome amplification of full-length HIV-1 env 
genes was performed as reported”*. 

Determination of plasma viral RNA decay. The kinetics of plasma HIV-1 RNA 
after the start of therapy was analysed as previously described” by a simple one- 
exponential model: In(V) = c — kt, where V is the HIV-1 RNA concentration, and 
kis virus decay rate constant (note that here k corresponds to k1 (ref. 30) andc =a 
constant close to the In of the baseline virus concentration). The half-life t,,. was 
calculated as 0.693/k. The data were fitted using the program Excel and the good- 
ness of fit was estimated by the correlation coefficients, which were in the range 
from 0.91 to 1.0. All time points in the exponential phase of decline were used for 
calculation. 
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Extended Data Figure 1 | Treatment of SHIV-infected macaques with single received the 3BNC117 monoclonal antibody and MB8 and MCN were 
anti-HIV-1 neutralizing monoclonal antibodies. Plasma viralloads andtotal administered the 10-1074 monoclonal antibody. Macaques MB7 and MD5 
CD4* T cell numbers before (the initial 84 days of the SHIV-AD8EO infection) _ were not treated. 

and during single monoclonal antibody treatment are shown. KZ6 and MB6 
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Extended Data Figure 2 | SGA analysis of selected SHIV-AD8EO gp120 (n = 8)), MB6 (day 23 (n = 8)), MB8 (day 23 (n = 9)) and MCN (day 23 


sequences, present in rebound virus after single monoclonal antibody (n = 8)). The gp120 sequences at the top are present in the SHIV-AD8EO 
immunotherapy, and known to confer resistance to 10-1074 or 3BNC117 —_—s molecular clone inoculum. Mutations conferring resistance are highlighted in 
monoclonal antibody. SGA was used to amplify plasma viral RNA after red. 
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Extended Data Figure 3 | Circulating CD4* T cells in five chronically 
SHIV-infected macaques treated with two anti-HIV-1 neutralizing 
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Extended Data Figure 5 | SGA analysis of selected SHIV-AD8EO gp120 (n = 6); day 41 post second treatment (n = 7)) (b), DBXE (pre (n = 9), day 28 
sequences known to confer resistance to 10-1074 or 3BNC117 monoclonal —_(” = 8)) (c), DCF1 (pre (n = 14), day 28 (nm = 10)) (d) and DCM8 (pre (n = 7), 
antibody, before and after combination immunotherapy. a-e, Plasmas from day 28 (n = 11)) (e) were evaluated. The gp120 sequences at the top are present 
animals DBZ3 (pre (n = 8); day 49 post first treatment (n = 10); day 24 post in the SHIV-AD8EO molecular clone inoculum. Mutations conferring 
second treatment (n = 8)) (a), DC99A (pre (n = 10); day 57 post first treatment resistance are highlighted in red. 
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Extended Data Figure 6 | CD4* T-cell numbers increase during 
combination monoclonal antibody treatment of SHIV-AD8EO-infected 
macaques. a, b, Levels of viral RNA and total CD4* T-cell/CD4* T-cell subsets 
in symptomatic chronically infected macaques DBXE (a) and DCF1 (b). 
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Extended Data Figure 7 | Assays to identify 10-1074- or 3BNC117-specific _ activities in plasma before antibody administration (preP), but spiked with 
neutralizing activities in the plasma of monoclonal-antibody-treated 0.01, 0.1, 1, 10 and 100 pg ml ‘ of antibodies 10-1074 (blue) or 3BNC117 
macaques. a, IDs, values measured in the TZM-bl neutralization assay of 10- (green). Neutralizing activities are reported as plasma IDsp titres (left columns) 
1074 and 3BNC117 against HIV-1 strains that are sensitive to one but not the —_ and converted to antibody concentrations (right columns) based on measured 
other broadly neutralizing antibody (that is, HIV-1 strain X2088_9 (10-1074 IDso values in a. 

sensitive); HIV-1 strain Q769_d22 (3BNC117 sensitive)). b, Neutralizing 
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Extended Data Figure 8 | Monoclonal antibody levels in the plasmas of antibody treatment (c). IDso titres (left columns) and monoclonal antibody 
monotherapy and combination monoclonal antibody macaque recipients. concentrations (right columns) were measured in the indicated macaque 
a-c, Macaques treated with one neutralizing monoclonal antibody (a); plasma samples before (Prebleed) and after (Day) monoclonal antibody 


macaques receiving two cycles of combination monoclonal antibody administration. 
treatment (b); macaques receiving a single cycle of combination monoclonal 
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Extended Data Table 1 | Cell-associated viral RNA/DNA in rhesus macaques receiving combination monoclonal antibody therapy 
SIV Gag RNA SIV Gag DNA 


Treatment Plasma Viral RNA 


Fi ; 8 : 8 
nine Time (Days) (copies/ml) Galea 1” ee mu 
DBZ3 1st 0 1.08E+04 9,000 6,700 
10 < 100 360 7,500 
20 < 100 2,600 14,000 
24 < 100 1,600 6,400 
27 < 100 670 5,700 
DBZ3 2nd 0 1.32E+04 52,000 15,000 
10 < 100 380 4,700 
13 4.09E+02 1,000 11,000 
DC99A 1st 0 7.60E+03 31,000 1,400 
14 < 100 18,000 5,600 
20 < 100 8,100 2,700 
27 < 100 400 790 
34 < 100 550 1,100 
M < 100 7,200 780 
DC99A 2nd 0 2.35E+03 23,000 2,100 
21 < 100 570 1,100 
27 < 100 1,100 2,100 
DBXE 0 1.96E+05 470,000 71,000 
14 3.59E+02 17,000 33,000 
17 7.55E+02 11,000 22,000 
20 < 100 11,000 33,000 
23 1.60E+03 17,000 27,000 
DCM8 0 1.59E+03 110,000 8,600 
14 < 100 1,700 1,600 
17 < 100 880 5,000 
20 1.51E+02 22,000 6,600 
DCF1 0 1.44E+05 240,000 15,000 
6 5.38E+03 34,000 5,500 
14 1.69E+05 190,000 11,000 
20 1.45E+05 1,100,000 14,000 
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Extended Data Table 2 | Decay rate constants of SHIV-AD8 RNA in plasma after monoclonal antibody treatment 


DC99A Ist 
DC99A 2nd 
DBZ3 Ist 

DBZ3 2nd 


DBXE 
DCF1 
DCM8 
KZ6 
MB6 
MB8 
MCN 


Rate constants of initial viral RNA decline in plasma (decay rate constants k) and corresponding half-lives (t1/2) were determined as previously described (ref. 26). The decay rate constants k were calculated by 
fitting the data with an exponential function. The fit was very good with correlation coefficients in the range from 0.91 to 1.0. 
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Cell migration requires the generation of branched actin networks 
that power the protrusion of the plasma membrane in lamellipodia’. 
The actin-related proteins 2 and 3 (Arp2/3) complex is the molecular 
machine that nucleates these branched actin networks’. This machine 
is activated at the leading edge of migrating cells by Wiskott-Aldrich 
syndrome protein (WASP)-family verprolin-homologous protein 
(WAVE, also known as SCAR). The WAVE complex is itself directly 
activated by the small GTPase Rac, which induces lamellipodia**. 
However, how cells regulate the directionality of migration is poorly 
understood. Here we identify a new protein, Arpin, that inhibits the 
Arp2/3 complex in vitro, and show that Rac signalling recruits and 
activates Arpin at the lamellipodial tip, like WAVE. Consistently, 
after depletion of the inhibitory Arpin, lamellipodia protrude faster 
and cells migrate faster. A major role of this inhibitory circuit, how- 
ever, is to control directional persistence of migration. Indeed, Arpin 
depletion in both mammalian cells and Dictyostelium discoideum 
amoeba resulted in straighter trajectories, whereas Arpin micro- 
injection in fish keratocytes, one of the most persistent systems of 
cell migration, induced these cells to turn. The coexistence of the 
Rac-Arpin-Arp2/3 inhibitory circuit with the Rac-WAVE-Arp2/3 
activatory circuit can account for this conserved role of Arpin in 
steering cell migration. 

The Arp2/3 complex is activated at different cellular locations by 
different nucleation promoting factors (NPFs), WAVE at lamellipo- 
dia, neural WASP (N-WASP) at clathrin-coated pits, and WASP and 
SCAR homologue (WASH) at endosomes”*. NPFs share a characteri- 
stic carboxy-terminal tripartite domain, referred to as the VCA’. The A 
(acidic) motif binds to the Arp2/3 complex and induces its conforma- 
tional activation. Arp2/3 inhibitory proteins containing an A motif, 
PICK1 and Gadkin (also known as AP1AR and y-BAR), were detected 
at endocytic pits and at endosomes’*”’. Thus, although endocytic pits 
and endosomes possess antagonistic activities towards the Arp2/3 com- 
plex, it was not known whether lamellipodia contain a similar Arp2/3 
inhibitory protein to counteract WAVE. To identify such a protein, 
we performed a bioinformatics search for proteins displaying the typ- 
ical A motif of human NPFs, characterized by a tryptophan residue at 
the antepenultimate position in an acidic context. We identified an 
uncharacterized protein (C15orf38) clustered with NPFs with this 
procedure (Fig. 1a). This protein was named Arpin. Arpin is encoded 
by a single gene in metazoans and amoeba. Predictions and NMR 
analysis indicate that this protein of about 220 residues is structured 


with the exception of its highly mobile C-terminal end, which contains 
the putative Arp2/3-binding site (Extended Data Figs 1 and 2). Indeed, 
Arpin binds to Arp2/3 mostly through its acidic motif (Fig. 1b and 
Extended Data Figs 3 and 4). 

The molecular function of Arpin on Arp2/3 activity was assayed by 
spectrofluorimetry and total internal reflection fluorescence (TIRF) 
microscopy using purified proteins. Arpin was unable to activate the 
Arp2/3 complex, consistent with its lack of verprolin (V) and cofilin 
(C) homology motifs. However, when Arp2/3 was activated by VCA, 
Arpin, but not its truncated form lacking the acidic motif (ArpinAA), 
inhibited actin polymerization in a dose-dependent manner (Fig. 1c). 
The acidic peptide was sufficient for this inhibition, although it was less 
effective than full-length Arpin, in line with its lower affinity for the 
Arp2/3 complex (Extended Data Fig. 4). Arpin inhibited Arp2/3 activa- 
tion, because we observed by TIRF microscopy the generation of fewer 
actin branched junctions in the presence of Arpin (Fig. 1d and Sup- 
plementary Video 1). Full-length Arpin and its acidic motif, but not 
ArpinAA, compete with VCA for Arp2/3 binding (Fig. leand Extended 
Data Fig. 4). Therefore, Arpin is a new competitive inhibitor of the 
Arp2/3 complex. The name Arpin is a mnemonic for its activity (Arp2/ 
3 inhibition). 

The subcellular localization of Arpin was examined by immuno- 
fluorescence in spreading mouse embryonic fibroblasts (MEFs). Arpin 
was detected in restricted segments of the plaama membrane, which 
were also stained by three lamellipodial markers—the WAVE complex, 
the Arp2/3 complex and cortactin (Fig. 2a and Extended Data Fig. 5). 
Radial line scans of immunofluorescence pictures through lamellipo- 
dial outlines were generated, registered with the edge as a reference, and 
averaged to reveal the relative distributions of these different lamelli- 
podial components. Arpin overlapped perfectly with the distribution 
of the WAVE complex, a tip component’ (Fig. 2a). Arpin is thus loca- 
lized at the lamellipodium tip, where new actin branches are nucleated 
by WAVE and Arp2/3 complexes. As expected, Arp2/3 and cortactin 
distribution extended rearwards compared to Arpin (Extended Data 
Fig. 5), because Arp2/3 and cortactin correspond to branches of the 
lamellipodial actin network undergoing retrograde flow with respect 
to the protruding membrane””’. 

To understand the regulation of Arpin activity, we examined the 
role of Rac, the master controller of lamellipodium formation. We co- 
transfected 293T cells with different forms of Rac and green fluor- 
escent protein (GFP)-Arpin and then analysed the interaction of 
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Figure 1 | Arpin inhibits Arp2/3 activation in vitro. a, Alignment of acidic C 
termini of three NPFs and of Arpin. b, Arpin binds to the Arp2/3 complex 
through its acidic C-terminal region. Glutathione S-transferase (GST) 
pulldown with full-length Arpin (FL), its last 16 amino-acids (A), ArpinAA or 
the VCA domain of N-WASP as a positive control. ArpC2 is a subunit of the 
Arp2/3 complex. c, Spectrofluorimetry assay using pyrene-labelled actin to 
monitor polymerization. a.u., arbitrary units. d, Assembly of branched actin 
networks monitored by TIRF microscopy using rhodamine-labelled actin. Scale 
bar, 5 um. e, Full-length Arpin, but not ArpinAA (18 LM and serial twofold 
dilutions), competes with the NPF for Arp2/3 binding. 


Arpin with the Arp2/3 complex through GFP immunoprecipitations. 
The active form of Racl, which is sufficient to induce lamellipodia, was 
also sufficient to induce Arp2/3 co-immunoprecipitation with Arpin 
(Fig. 2b). To examine whether Rac is required for Arpin activation, we 
used Racl knockout MEFs that lack lamellipodia™*. The absence of Rac 
abrogated the peripheral localization of Arpin in all knockout MEF 
cells examined (Extended Data Fig. 5). Endogenous Arpin was then 
immunoprecipitated from Racl knockout MEFs. The Arp2/3 complex 
co-immunoprecipitated with Arpin in control MEF cells, but not in 
Rac-deficient cells (Fig. 2c). Together, these results show that, in res- 
ponse to Rac signalling, Arpin inhibits the Arp2/3 complex at the 
lamellipodium tip, that is, where Rac also stimulates actin polymeriza- 
tion through the WAVE complex. 

This counterintuitive finding suggests that Arpin would bea built-in 
brake of protrusions. Human RPEI cells transiently transfected with 
short hairpin RNAs (shRNA) that efficiently deplete Arpin displayed 
increased lamellipodia-mediated cell spreading (Extended Data Fig. 6 
and Supplementary Video 2). Arpin depletion increased protrusion 
velocity of lamellipodia, consistent with its Arp2/3 inhibitory role 
(Fig. 2d). This effect was fully rescued by Arpin re-expression, but only 
when Arpin contained the acidic motif. Arpin thus provides a para- 
doxical negative circuit downstream of Rac. Such a circuitry, where 
Rac induces and inhibits actin polymerization, generates an ‘incoherent 
feedforward loop’ (Fig. 2e), which can favour temporal regulations’. 

To examine whether the Arpin circuit is physiologically relevant for 
cell migration, we impaired the expression of the arpin gene in zebra- 
fish embryos using morpholinos. During gastrulation, prechordal plate 


282 | NATURE | VOL 503 | 14 NOVEMBER 2013 


3 
£& 300 
@ 20 , 


0 
5 4-3-2 -10 1 
Distance (um) 


So 


Fluo. intensity 


b c 
& _GEP-Arpin 
0 0 TNWTAQL: PC-Rac & 


ee ee ee ae | GFP-Arpin 
ee ee ee we | Arpin . 
se rc 


Input 


rm 
d 
tok 
, 25 Sy 
= *kk FT 
ga? 
Oc 
£15 
@ & 10 
ga 
BS 05 
cs 0 
shRNA Ctrl Arpin 
RN 
GFP eo s 
ys 
e 
WAVE 
wee 
Rac Arp2/3 Branched 
= actin 


se. complex 
> Arpin 


Figure 2 | Arpin inhibits the Arp2/3 complex at the lamellipodium tip. 

a, Arpin colocalizes with Brk1, a subunit of the WAVE complex, at the 
lamellipodium tip (mean + s.e.m., 24 line scans). Scale bar, 20 um. b, Active 
(Gln61Leu; QL), but not wild-type (WT), nor dominant negative (Thr17Asn; 
TN), Rac induces Arpin association with the Arp2/3 complex. IP, 
immunoprecipitation. PC, protein C epitope. c, Rac is required for the Arpin- 
Arp2/3 interaction. KO, knockout. d, Arpin depletion increases the speed at 
which lamellipodia protrude. GFP-Arpin expression rescues the phenotype. 
Ctrl, control; shCtrl, control shRNA; shArpin, arpin shRNA. Data are 

mean + s.e.m.; n = 40; ***P < 0.001, two-tailed analysis of variance 
(ANOVA). e, Arpin circuitry. 


cells collectively migrate towards the animal pole. After arpin loss of 
function, cell movements were less coordinated (Extended Data Fig. 7). 
Prechordal plate cell transplants revealed a cell autonomous effect of 
Arpin on protrusions. Protrusions were more frequent and more per- 
sistent over time in the absence of Arpin (Supplementary Video 3). 
This observation is consistent with the incoherent feedforward loop, a 
circuitry that can suppress the protrusion it creates. 

To understand the role of the incoherent feedforward loop in cell 
migration further, we performed Arpin loss-of-function experiments 
in cell systems migrating as individual cells. Migration of stable Arpin- 
depleted clones from the breast human cell line MDA-MB-231 was 
analysed by video microscopy in two or three dimensions (Fig. 3a, 
Supplementary Videos 4 and 5, and Extended Data Fig. 8). In both 
cases, the tracks illustrate that Arpin-depleted cells explored a larger 
territory than control cells, an observation substantiated by mean 
square displacements (Extended Data Fig. 9). Increased exploration 
was not only due to increased speed, but also to increased directional 
persistence, measured as the ratio of the distance between two points 
by the actual trajectory (Fig. 3a) or as the direction autocorrelation 
function (Extended Data Fig. 10). Because Arpin is conserved in amoeba, 
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Figure 3 | Arpin depletion increases directional persistence of migration in 
mammalian cells and in Dictyostelium discoideum. a, Arpin-depleted 
(shArpin1 and shArpin2) MDA-MB-231 cells explore a wider territory than 
controls owing to increased speed and directional persistence. The 
directionality index is calculated as the ratio of the distance between starting 
and ending points (d) by the actual trajectory (D). Mean + s.e.m.;n = 24, 31, 32 
and 26, respectively; *P < 0.05, two-tailed Kruskal-Wallis test. b, Arpin- 
knockout amoeba explores a wider territory than wild type owing to increased 
speed and directional persistence. Directionality is more than fully rescued by 
Dictyostelium Arpin expression in knockout amoeba. Data are mean + s.e.m.; 
n= 116, 196 and 45, respectively; *P < 0.05, two-tailed Kruskal-Wallis test. 


we analysed its function in Dictyostelium discoideum by knocking out the 
orthologous gene. As in mammalian cells, Arpin-knockout Dictyostelium 
amoebae explored a wider territory than the controls owing to increased 
cell speed and directional persistence (Fig. 3b and Supplementary Video 6). 
However, directional persistence, which is more than fully rescued by 
GFP-Arpin expression, is the most affected parameter in amoeba. 

Arpin could thus bea ‘steering factor’. To test this hypothesis directly 
in a gain-of-function experiment, we selected the fish keratocyte model. 
These cells are characterized by fast migration based on a wide fan- 
shaped lamellipodium with high directional persistence. We purified 
zebrafish Arpin and ArpinAA from Escherichia coli and microinjected 
these proteins into migrating trout keratocytes (Fig. 4a). Injection of 
Arpin, but not ArpinAA, caused keratocytes to reduce their speed and 
deviate from their initial direction of migration (Fig. 4b and Extended 
Data Fig. 10). Notably, Arpin did not prevent lamellipodia protrusion, 
but resulted in cycles of suppression of existing lamellipodia followed 
by formation of new, ectopic ones (Fig. 4c and Supplementary Video 7). 
Collectively, the experiments performed in different systems of cell 
migration support a role for Arpin in promoting cell steering: Arpin 
slows cells down and allows them to turn. 

In a computational model of efficient and persistent cell migration, 
the lamellipodium spatially determines where the WAVE and the 
Arp2/3 complexes will next polymerize actin, thus maintaining the 
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Figure 4 | Arpin microinjection induces fish keratocyte to turn. a, Gallery of 
fish keratocytes microinjected with purified full-length Arpin, ArpinAA or 
buffer as control. Scale bar, 20 um. b, Quantifications. Data are mean + s.e.m.; 
n= 16, 15 and 11, respectively; *P < 0.05, ***P < 0.001, two-tailed ANOVA. 
c, Kymograph of the Arpin-microinjected keratocyte. d, Model. 


front at the front over time’® (Fig. 4d). Such a feedback loop, sensing 
where branched actin is polymerized and activating Rac as a response, 
has recently been identified: it involves coronin 1A and the Rac exchange 
factor B-Pix’’. In this feedback system, the WAVE complex closes a 
positive feedback loop that maintains efficient directional migration 
over time, whereas Arpin closes a concurrent negative feedback loop, 
which induces braking and allows turning. These two nested feedback 
loops, positive and negative, can account for the emergence of oscilla- 
tions in lamellipodium protrusion/retraction, as observed in fish ker- 
atocytes after Arpin injection (Supplementary Video 8), and for various 
travelling actin waves described in different systems'*”’. 

Other proteins were previously shown to regulate cell steering”. 
Knockdown of Racl or cofilin, a protein that severs and depolymerizes 
actin filaments, increases directional persistence of mammalian cells™*”*. 
These proteins are required, however, for lamellipodial protrusion and 
actin-based motility”®””. Arpin is unique in that it regulates cell steer- 
ing, while being dispensable for lamellipodial protrusion and efficient 
migration. Arpin is a prime candidate to fine-tune numerous physio- 
logical migrations biased by diverse cues*’. Arpin also seems to have a 
role in preventing cells from migrating. In this respect, dissection of the 
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mechanisms regulating Arpin in physiology and pathology is a major 
challenge ahead of us. 


METHODS SUMMARY 

Arpin purification and antibody production. Arpin was purified from E. coli as 
a glutathione S-transferase (GST) fusion protein and cleaved off GST. Untagged 
Arpin was used for in vitro assays of actin polymerization, antibody generation 
and keratocyte injection. Rabbit polyclonal antibodies targeting full-length Arpin 
were purified by affinity chromatography. Immunoprecipitation of endogenous 
Arpin was performed using purified antibodies coupled to magnetic beads. 

Cells and imaging. RPE1 cells were electroporated with plasmids. Plasmids 
encoding fusion proteins or shRNAs were transfected into MDA-MB-231 cells 
using liposomes. Lamellipodial dynamics and random migration were analysed 
with Image] using the plug-ins ‘Kymograph’ and ‘MtrackJ’, respectively. Live-cell 
imaging was performed using an inverted Axio Observer microscope (Zeiss) 
equipped with two chambers controlled for temperature and CO,. For Arpin 
localization, MEF cells were fixed with 10% TCA, permeabilized with 0.2% Triton 
X-100, and processed for immunofluorescence. To draw radial line scans, a custom- 
made ImageJ plug-in was developed, edge was determined using ‘Isodata’ and 
‘Analyze particle’, and then a custom-made VBA macro in Excel was used to align 
data relative to the edge. Keratocytes were isolated from scales of freshly killed 
brook trout (Salvelinus fontinalis) and microinjected with a Femtojet (Eppendorf). 
Plots were drawn with SigmaPlot (SPSS) and statistics were performed with 
SigmaStat (SPSS). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 
Plasmids. Human and zebrafish Arpin were amplified by PCR from clones 
IMAGE:5770387 and IMAGE:7404342, respectively (GeneService). Dictyostelium 
discoideum DdArpin was amplified from Ax2 cDNA. Human full-length Arpin 
(residues 1-226), ArpinAA (residues 1-210) or ArpinA (residues 211-226), zebra- 
fish full-length Arpin (residues 1-226), ArpinAA (residues 1-210), and murine 
N-WASP VCA fragment (residues 392-501)”* were cloned into a modified pGEX 
vector with a TEV cleavage site between the restriction sites Fsel and AsclI. For 
expression in mammalian cells, Arpin inserts were cloned into a compatible plas- 
mid pcDNAm PC-GFP blue’. Zebrafish full-length Arpin was also inserted 
pBluescript to generate probes for in situ hybridization and in pCS2-GFP for 
rescue experiments. Human RAC1 wild-type, Thr17Asn, Gln61Leu, the Arp2/3 
complex subunits ArpC5A and ArpC5B” were cloned into pcDNA5 His PC 
TEV blue’. For expression in amoeba, Dictyostelium Arpin was inserted into 
pDGFP-MCS-neo”. For shRNA-expressing plasmids, two hybridized oligonu- 
cleotides (MWG) were cloned into psiRNA-h7SKblasti G1 (Invivogen) according 
to the manufacturer’s protocol. The following target sequences were used: shArpin 
1: 5'-GGAGAACTGATCGATGTATCT-3’; shArpin 2: 5'-GCTTCCTCATGT 
CGTCCTACA-3’; shArpin 3: 5’-GCCTTCCTAGACATTACATGA-3’ (targets 
the 3’ untranslated region (UTR) of arpin mRNA); shArpC2 1: 5'-CATGTATGT 
TGAGTCTAA-3’; and shArpC2 2: 5'-GCTCTAAGGCCTATATTCA-3’. These 
plasmids were compared to the non-targeting control provided by Invivogen; 
shControl: 5'-GCATATGTGCGTACCTAGCAT-3’. All constructs were verified 
by sequencing. 
Protein purification. Arpin, ArpinAA, ArpinA and N-WASP VCA fused to GST 
were purified from E. coli BL21* strain (Life Technologies) using standard puri- 
fication protocols, dialysed against storage buffer (20 mM Tris-HCl, 50 mM NaCl, 
1 mM dithiothreitol (DTT), pH 7.5), frozen in liquid nitrogen and stored at —80 °C. 
When indicated, Arpin was cleaved by TEV protease off GST. Arpin bound to 
glutathione sepharose 4B beads was cleaved by overnight incubation at 4 °C using 
His-tagged TEV protease in 50 mM Tris, pH 7.5, 2 mM B-mercaptoethanol, 100 mM 
NaCl and 5mM MgCl. TEV was removed by incubation with Ni?* beads (GE 
Healthcare). Arpin was further purified by size-exclusion chromatography (SEC) 
on a Superdex-200 column (GE Healthcare) and concentrated on Vivaspin filters. 
Human Arpin was used for the production of polyclonal antibodies and competi- 
tion experiments. Zebrafish Arpin was similarly produced, purified and used 
for keratocyte injection at 7.5 1g ult in 15mM Tris-HCl, 150 mM NaCl, 5mM 
MgCl, and 1 mM DTT, pH 7.5. Both proteins had an amino-terminal extension of 
10 amino acids (GAMAHMGRP) after TEV cleavage. ArpinA peptide (residues 
211-226 of full-length Arpin) was purchased from Proteogenix. For the SEC 
coupled to multiangle light scattering (SEC-MALS) characterization, proteins 
were separated in a 15-ml KW-803 column (Shodex) run on a Shimadzu HPLC 
system. MALS, quasi-elastic light scattering (QELS) and refractive index (RI) measure- 
ments were achieved with a MiniDawn Treos (Wyatt technology), a WyattQELS 
(Wyatt technology) and an Optilab T-rEX (Wyatt technology), respectively. 
Molecular mass and hydrodynamic radius calculations were performed with the 
ASTRA VI software (Wyatt Technology) using a dn/dc value of 0.183 ml g |. 
Antibodies. Polyclonal antibodies targeting Arpin were obtained in rabbits (Agro- 
Bio) against purified human Arpin, and then purified by affinity purification on a 
HiTrap NHS-activated HP column (GE Healthcare) coupled to the immunogen. 
ArpC2 polyclonal antibody and cortactin monoclonal antibody (clone 4F11) were 
from Millipore. ArpC5 monoclonal antibody (clone 323H3) was from Synaptic 
Systems. Brk1 monoclonal antibody (clone 231H9) was described earlier*'. Tubulin 
monoclonal antibody (clone E7) was obtained from Developmental Studies 
Hybridoma Bank. PC monoclonal antibody (clone HPC4) was from Roche. 
In vitro assays of actin polymerization. Pyrene actin assays and monitoring of 
the branching reaction were performed as described previously’. VCA refers to 
the VCA domain of WAVE] purified as described’’. Conditions for Fig. 1c were: 
2 uM actin (10% pyrene-labelled), 500nM VCA, 20nM Arp2/3 and Arpin full- 
length or ArpinAA at the indicated concentrations. Conditions for Fig. 1d were: 
1M actin (10% rhodamine-labelled), 150nM VCA, 80nM Arp2/3 and 5M 
Arpin when indicated. 
Fluorescence anisotropy based determination of Kj. The ArpinA peptide was 
synthesized and labelled with 5-TAMRA at the amino terminus (Proteogenix). 
The peptide was excited with polarized light at 549 nm and emitted light was 
detected at 573 nm using a MOS450 fluorimeter (Biologic). Measurements were 
made for 60s at 1 points ', and the average anisotropy was calculated with the 
Biologic software. Fits were performed as described previously™*. 
GST pull-down, immunoprecipitations, SDS-PAGE and western blots. HeLa 
cells were lysed in 50 mM Tris-HCl, 150mM NaCl, 1mM EDTA, 1mM DTT, 
0.5% Triton X-100 and 5% glycerol, pH 7.5. GST fusion protein (20 1g) associated 
with 20 il of glutathione sepharose 4B beads (GE Healthcare) was incubated with 
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1ml Hela cell extracts for 2h at 4°C. Beads were washed and analysed by 
western blot. 

Co-immunoprecipitation of Arpin with the Arp2/3 complex was performed 
with either two 15-cm dishes of MEF cells, or one 10-cm dish of transfected 293T 
cells. Cell lysates prepared in 10 mM HEPES, pH 7.7, 50mM KCl, 1mM MgCh, 
1mM EGTA and 1% Triton X-100 were incubated with 10 ug of non-immune 
rabbit IgG or 10 ug of anti-Arpin antibodies coupled to tosyl-activated dynabeads 
(Life Technologies) or to GFP-trap beads (Chromotek). Beads were incubated 
with extracts for 2h at 4°C, washed and analysed by western blot. 

SDS-PAGE was performed using NuPAGE 4-12% Bis-Tris gels (Life Tech- 

nologies). For western blots, nitrocellulose membranes were developed using 
horseradish peroxidase (HRP)-coupled antibodies, Supersignal kit (Pierce) and 
a LAS-3000 imager (Fujifilm). 
Cells and transfections. hTERT immortalized RPE1 cells (Clontech) were grown 
in DMEM/HAM F12, MEFs and 293T cells in DMEM, and MDA-MB-231 cells 
were grown in RPMI. All media were supplemented with 10% FBS (media and 
serum from PAA Laboratories). All cells and stable clones were found negative for 
mycoplasma infection by a sensitive PCR assay. 

RPE] cells were electroporated with ECM 630 BTX (Harvard Apparatus). Ten- 
million cells were resuspended in 200 pil serum-free DMEM/HAM F12 medium 
containing 7.5 mM HEPES, pH 7.5, mixed with 10-40 ig DNA plasmid in 50 ul 
210 mM NaCl and electroporated at 1,500 uF and 250 V. To isolate stable Arpin- 
depleted clones, MDA-MB-231 cells were transfected with shRNA Arpin 3 or 
shControl using Lipofectamine 2000, and clones selected with 10 1g ml * blasti- 
cidin (Invivogen) were isolated with cloning rings and expanded. For rescue 
experiments, cells transfected with shRNA 3, which targets the 3’ UTR, were trans- 
fected with GFP-Arpin, which lacks UTR sequences. To validate Arpin locali- 
zation, MEFs were transfected with non-targeting (D-001810-10) or Arpin targeting 
(J-059240-10; ON-TARGET plus siRNA, Dharmacon) using lipofectamine RNAiMax 
(Life Technologies), and examined after 2 days. 

Immunofluorescence and live imaging of mammalian cells. Cells were fixed in 
10% TCA, permeabilized with 0.2% Triton X-100, and processed for immuno- 
fluorescence. To draw radial line scans, a custom made ImageJ plug-in was 
developed, edge was determined using ‘Isodata’ thresholding, then a custom-made 
VBA macro in Excel was used to align data relative to the edge. Lamellipodial 
dynamics and random migration were analysed with ImageJ using the plugins 
‘Kymograph’ and ‘MtrackJ’, respectively. All imaging was done on a Axio 
Observer microscope (Zeiss) equipped with a Plan-Apochromat 63X/1.40 oil 
immersion objective, an EC Plan-Neofluar 40X/1.30 oil immersion objective 
and a Plan-Apochromat 20X/0.80 air objective, a Hamamatsu camera C10600 
Orca-R? and a Pecon Zeiss incubator XL multi $1 RED LS (Heating Unit XL S, 
Temp module, CO, module, Heating Insert PS and CO cover). 

Fish keratocytes. Keratocytes were isolated from scales of freshly killed brook 
trout (Salvelinus fontinalis) as previously described** and imaged by phase contrast 
on an inverted Zeiss Axioscope using X63 optics, and a halogen lamp as light 
source. Microinjection was performed with a micromanipulator (Leitz) and a 
micro-injector Femtojet (Eppendorf) controlling backpressure and injection pulses. 
Contours were analysed using the CellTrack software (Ohio State University). 
Zebrafish. Embryos were obtained by natural spawning of Tg(-1.8gsc:GFP)mll 
fish**. In these embryos, prechordal plate cells can be identified by their expression 
of GFP. In situ hybridization was performed following standard protocols*’. For 
loss of function experiments, a morpholino directed against arpin (GTTGTCAT 
AAATACGACTCATCTITC, where the underlined anticodon corresponds to the 
initiating ATG codon), or a standard control morpholino (CCTCTTACCTCAG 
TTACAATTTATA) was injected at the one-cell stage, together with histone2B- 
mCherry mRNAs or Lifeact-mCherry mRNAs, and GFP—Arpin mRNAs for rescue 
experiments. To analyse cell trajectories, confocal z-stacks were acquired every min- 
ute using a Nikon confocal spinning disk with an Evolve camera (Photometrics). 
Nuclei were tracked using Imaris (Bitplane). Further analyses were performed 
using custom routines in Matlab (MathWorks)**. All animal studies were done 
in accordance with the guidelines issued by the French Ministry of Agriculture 
(Decree no 2013-118) and have been submitted to Paris ethical committee no. 3. 
Dictyostelium discoideum. Cultivation and transformation by electroporation of 
D. discoideum cells was performed as described”’. To knockout Arpin, a piece of 
genomic DNA containing the arpin coding sequence with its intron was amplified 
from Ax2 wild-type amoeba, using oligonucleotides DdArpin_BU 5'-CGCGGA 
TCCGCATGAGTTCAAGTACAAATTATAGT-3’ and DdArpin_SD 5'-CGCG 
TCGACTTTATTTCCATTCATCATCATCTTC-3’. The cloned PCR fragment 
was then used as a template to amplify a 5’ fragment (using 5’-CGCGGATCCG 
CATGAGTTCAAGTACAAATTATAGT-3’ and 5’-GCGCTGCAGCATCTGA 
AATTGCAACTGATAGTTG-3’) and a 3’ fragment (using 5‘-GCGAAGCTTTC 
TICTTTACCTTCAAATTTTCAT-3’ and 5'-CGCGTCGACGTTGGTTATTT 
GATTCTATTTGATC-3’). These two fragments were cloned as to flank a cassette 
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carrying Blasticidin resistance in pLPBLP vector**. Arpin knockout clones were 
selected in HL5c-medium supplemented with 10 1g ml” * blasticidin S (Invivogen) 
after electroporation of the resulting vector. Recombination was assessed using 
diagnostic PCRs that distinguish knockout from wild-type amoeba. GFP—Arpin 
re-expressing knockout lines were obtained after electroporation of pDGFP-Arpin 
and selection with 10 1g ml * geneticin (Sigma). Two time series with more than 
30 cells each were acquired per amoeba. Two clones isolated after each transforma- 
tion gave similar results. 

Statistics. Statistical analysis of the results was carried out with SigmaStat software 
(SPSS inc., v2.03). When data satisfied the two criteria of normality and equal 
variance, parametric tests were used: t-test to compare two groups; ANOVA for 
more than two. Where indicated, a bijective transformation was applied to the data 
to pass the two criteria of normality and equal variance. When data did not satisfy 
both criteria even after transformation, non-parametric tests were applied: Mann- 
Whitney to compare two groups; Kruskal-Wallis for more than two. A repres- 
entative experiment is plotted and results are expressed as mean s.e.m. with respect 
to the number of cells (1). Differences were considered significant at confidence 
levels greater than 95% (two-tailed). Three levels of statistical significance are 
distinguished: *P < 0.05; **P < 0.01; ***P < 0.001. 
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Extended Data Figure 1 | Prediction of secondary structure elements and 
disordered regions of Arpin. A multiple alignment of the Arpin orthologues 
was performed with MUSCLE" and displayed with Jalview”. Two methods 
relying on multiple alignments of Arpin orthologues were used to predict 
secondary structures and disordered regions, Psipred** and Disopred*, 
respectively. The predicted secondary structure (SS) elements are indicated by 
green arrows for B-strands, red cylinders for o-helices, and a black line for coils; 
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the associated confidence (conf) score is displayed below, ranging from 0 to 9 
for poor and high confidence, respectively. Amino acid conservation is 
indicated by a 0 to 10 score, and highlighted by brown to yellow histogram bars. 
The confidence in predicting disorder is also scored from 0 to 10, by 
multiplying tenfold the Disopred probability. Arpin is predicted to be a 
structured protein with the notable exception of the 20 C-terminal residues, 
which are predicted to be disordered with high confidence. 
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residues are clustered in the centre of the spectrum. b, Same HSQC spectrum 
displayed with a higher threshold to display only high peaks. The height of a 
peak depends on the mobility of the residue on a picosecond to millisecond 
timescale. This spectrum experimentally demonstrates that the 20 C-terminal 
residues are highly mobile. This result confirms that, as predicted, the Arp2/3- 
binding site of Arpin is exposed as a poorly structured tail of the protein. 


Extended Data Figure 2 | NMR analysis of '°N-labelled human Arpin. Both 
views represent 'W-)°N heteronuclear single quantum coherence (HSQC) 
spectra. Each peak corresponds to the ‘H-'°N backbone amide bond of a 
specific residue. The position of a 'H-'°N peak in the spectrum depends on the 
chemical environment of the corresponding residue. a, Such a scattered 
distribution of peaks is characteristic of a folded protein. The last 20 residues 
were assigned to individual peaks and are displayed on the spectrum. These 
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Extended Data Figure 3 | Characterization of recombinant Arpin. a, Full- 
length Arpin or ArpinAA from human or zebrafish cDNA was expressed in E. 
coli and purified. Purity was assessed by SDS-PAGE and coomaassie staining. 
These proteins were used for in vitro actin polymerization assays and for fish 
keratocyte injection, respectively. b, Analysis of the molar mass of full-length 
Arpins by size-exclusion chromatography coupled to multiangle light 
scattering (SEC-MALS). The ultraviolet measurement (left axis, dashed line) 
and the molar mass (right axis, horizontal solid line) were plotted as a function 
of column elution volume. c, SEC-MALS measures of masses indicate that both 
proteins are monomeric in solution. d, GST pull-down using a lysate of 293 cells 
overexpressing PC-tagged Arpin and purified GST-Racl wild type, 


Elution volume (ml) 


GST-Racl(Gln61Leu) or GST alone as a negative control. Arpin did not 
associate with either type of Rac. By contrast, the endogenous WAVE complex 
bound to Rac(Gln61Leu), but not Rac wild type, as expected from a Rac 
effector. e, Untagged Racl was purified from E. coli and then loaded with either 
GDP or GTPyS. Human Arpin (60 LM), Racl (120 1M) and mixture of these 
two proteins were analysed by SEC-MALS as above. SEC was run in 20 mM 
HEPES, 100 mM NaCl, pH 7.4. The height of ultraviolet peaks was normalized 
to 1 to be displayed on the same figure. A single peak was detected in all cases. 
The measured masses indicate that no complex is formed between Arpin and 
Rac and that the single peak observed in the mixture corresponds to 
cofractionation of the two proteins of similar mass by SEC. 
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Extended Data Figure 4| Arpin directly binds to the Arp2/3 complex. 

a, Fluorescence anisotropy measurements of labelled ArpinA peptide binding 
at equilibrium to the purified Arp2/3 complex at the indicated concentrations. 
b, Labelled ArpinA peptide bound to the Arp2/3 complex was then titrated with 
purified Arpin full-length, ArpinAA or unlabelled ArpinA peptide as indicated. 
Full-length Arpin displaces the labelled ArpinA peptide more efficiently than 
the A peptide. ArpinAA is unable to displace the ArpinA peptide. Curves that 
best fit the values yield the indicated equilibrium constants. c, Arpin inhibits 
Arp2/3 activation in the pyrene-actin assay. Part of this experiment is displayed 
in Fig. 1c, more curves are plotted here. d, From curves in c, the number of actin 
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barbed ends is calculated from the slope at half-polymerization using the 
relationship described previously**. Best fit of the values indicate an apparent 
Kg value of 760 + 156 nM for the Arp2/3 complex in a mixture including actin 
and the VCA. e, ArpinA inhibits Arp2/3 activation in a dose dependent manner 
in the pyrene-actin assay. Conditions: 2 1M actin (10% pyrene-labelled), 
500nM VCA, 20nM Arp2/3 and ArpinA at the indicated concentrations. 

f, ArpinA competes with the NPF for Arp2/3 binding. Arp2/3 is displaced from 
its interaction with 5 4M GST N-WASP VCA immobilized on glutathione 
beads by the Arpin acidic peptide (304 UM and serial twofold dilutions). 
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Extended Data Figure 5 | Specific localization of Arpin at the 
lamellipodium tip. a, Arpin overlaps with the Arp2/3 complex at the 
lamellipodium tip. b, Arpin overlaps with cortactin at the lamellipodium tip. 
Arpin staining is lost after short interfering RNA (siRNA)-mediated depletion. 
Intensity profiles along multiple line scans encompassing the cell periphery 
were registered to the outer edge of the staining of a lamellipodial marker. This 
marker was Arpin in a and cortactin in b. The multiple line scans were then 
averaged and displayed as an intensity plot, where the y axis represents 
fluorescent intensity, arbitrary units (mean + s.e.m., n = 17, 16 and 17, 
respectively). Scale bar, 20 um. Arp2/3 localization extends rearwards relative 
to Arpin localization. This result is because the Arp2/3 complex becomes a 
branched junction when activated by the WAVE complex at the lamellipodium 
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tip. The branched junction undergoes retrograde flow like actin itself due to 
actin filament elongation”’’. Cortactin recognizes Arp2/3 at the branch 
junction and is thought to stabilize branched actin networks’. As a marker of 
the branched junction, cortactin stains the width of lamellipodia, like the 
Arp2/3 complex. c, Racl knockout MEFs that lack lamellipodia™* are 
completely devoid of Arpin staining at the cell periphery, in line with the 
complete absence of lamellipodia indicated here by the absence of cortactin 
staining. Arpin is normally expressed in the Racl knockout MEFs (see Fig. 2c). 
Intensity profiles along multiple line scans encompassing the cell periphery 
were averaged after manual drawing of the cell edge (mean + s.e.m., n = 16). 
Scale bar, 20 pm. 
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Extended Data Figure 6 | Arpin regulates cell spreading through its 
interaction with the Arp2/3 complex. Arpin was depleted from human RPE1 
cells after transient transfection of shRNA plasmids and blasticidin-mediated 
selection of transfected cells. After 5 days, cells were either analysed by western 
blot or used for the spreading assay. Cells were serum-starved for 90 min in 
suspension in polyHEMA-coated dishes and then allowed to spread on 
collagen-I-coated coverslips for 2 h. Phalloidin staining was used to calculate 
cell surface area of individual cells using ImageJ. Mean + s.e.m.; **P < 0.01, 
***P < 0,001; t-test or ANOVA when more than two conditions. a, Arpin 


depletion increases cell spreading (n = 57 and 52, respectively). The same effect 
is obtained with three shRNAs targeting Arpin (n = 51, 48 and 59, 
respectively). b, This effect is rescued by GFP—Arpin expression in knockdown 
cells, but not by GFP-ArpinAA expression(n = 63, 56, 63 and 52, respectively). 
c, Combined depletion of Arpin and the Arp2/3 complex reverses the 
phenotype of Arpin depletion. The effect is seen with two shRNAs targeting 
ArpC2 (n = 56, 60, 69, 68, 63 and 66, respectively). The last two experiments 
indicate that Arpin exerts its effect on cell spreading through its ability to 
regulate the Arp2/3 complex. 
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Extended Data Figure 7 | Arpin regulates protrusion frequency of 
prechordal plate cells and their collective migration in zebrafish embryos. 
a, In situ hybridization of arpin probe in zebrafish embryos at different stages. 
arpin mRNAs are maternally deposited. During gastrulation, arpin is expressed 
in hypoblast, which includes the prechordal plate. b, Three-dimensional 
trajectories of prechordal plate cells in embryos injected with control or arpin 
morpholino. During fish gastrulation, prechordal plate cells migrate 
collectively in a straight direction from the margin of the embryo towards the 
animal pole**“*. Loss of arpin function induces dispersion as evidenced by 
increased lateral cell displacement (# = 1,516 and 1,546) and a higher distance 
between cells (n = 194 and 235). Lateral displacement is the cell movement 
perpendicular to main direction of the migration. Distance between cells refers 
to the average distance of the nucleus of a given cell to the nuclei of its five 
closest neighbours. Mean + s.e.m.; ***P < 0.001, t-test. c, At the onset of 
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gastrulation prechordal plate cells derived from morpholino injected embryos 
were transplanted into the prechordal plate of an untreated host embryo at 
the same stage in order to allow imaging of cell autonomous effects on 
protrusion formation. Donor embryos are injected with control or arpin 
morpholinos and mRNAs encoding Lifeact-mCherry as well as GFP—Arpin for 
the rescue. Time-lapse imaging of injected cells is performed by epifluorescence 
to reveal Lifeact, a marker of filamentous actin, which stains actin-based 
protrusions. For each cell, presence of a protrusion was assessed at each frame 
to deduce probability of protrusion presence and protrusion lifetimes. arpin 
loss of function increases the probability of presence of protrusions 

(n = 8, 8 and 10, respectively; *P < 0.05, ANOVA) and their duration 

(in this case, n corresponds to the number of protrusions (n = 42, 41 and 40, 
respectively; *P < 0.05, Kruskal-Wallis). Protrusions are indicated by 
arrowheads. Scale bar, 50 um. 
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Extended Data Figure 8 | Arpin depletion increases cell migration in three 
dimensions. Stable MDA-MB-231 clones depleted of Arpin or not were 
embedded in a collagen gel. a, Single-cell trajectories illustrate that control cells 
hardly move in this dense environment (see Supplementary Video 4), unlike 
Arpin-depleted cells, which explore a significant territory, albeit at lower pace 
than in two dimensions, as evidenced by mean square displacement (Extended 
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Data Fig. 9). b, Cell speed is significantly increased in the Arpin-depleted 
clones. Mean = s.e.m.; n = 27, 25, 26 and 17, respectively, *P < 0.05, 
Kruskal-Wallis, two experiments. Directional persistence, calculated by d/D, is 
not significantly different in the clones depleted of Arpin or not. Direction 
autocorrelation (Extended Data Fig. 10), however, shows an increased 
directionality in the Arpin-depleted cells at the earliest time points. 
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Extended Data Figure 9 | Analysis of mean square displacement of the environment. Arpin-depleted MDA-MB-231 cells explore a larger territory 
different migration experiments. The mean square displacement gives a than the controls in time intervals examined (for two dimensions, n as indicated 
measure of the area explored by cells for any given time interval. By setting a —_in Fig. 3a; P< 0.001, two-way ANOVA with time and conditions; for three 
positional vector on the cellular trajectory at time tf, the MSD is defined as: dimensions, n as indicated in Extended Data Fig. 8; P< 0.001, two-way 


MSD(At) = ([x(t + fo) x(to)}” + [y(t + to) y(to)]”) > in which brackets () | ANOVA with time and conditions). b, Dictyostelium discoideum knockout 
indicate averages over all starting times fy and all cells N. For each time interval amoebae explore a larger territory than controls and rescued amoebae (n as 
Atime, mean and s.e.m. are plotted. Error bars corresponding to s.e.m. are indicated in Fig. 3b, P< 0.001, two-way ANOVA with time and conditions). 
plotted, even if too small to be visible. The grey area excludes the noisy part of __c, Arpin-injected fish keratocytes explore a smaller territory than the controls 
curves corresponding to large time intervals where less data points areavailable. _(n as indicated in Fig. 4b; P< 0.001, two-way ANOVA with time and 

a, MDA-MB-231 depleted or not of Arpin in a two- or three-dimensional conditions). 
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Extended Data Figure 10 | Analysis of direction autocorrelation of the 
different migration experiments. a, Principle of the analysis. A hypothetical 
cell trajectory is depicted. Each step is represented by a vector of 

normalized length. 0 is the angle between compared vectors. The plot illustrates 
the cos@ values for the putative trajectory of four steps (colour-coded). 
Averaging these cos@ values yields the direction autocorrelation (DA) 
function of time that measures the extent to which these vectors are 

aligned over different time intevals. The DA function is defined as: 

DA(t) = <v(to)*v(to + £) > 10." = < cos O(to to +f) > to.n » in which o(fp) is the 
vector at the starting time fo, and u(fp + #) the vector at fy + t. Brackets indicate 
that all calculated cosines are averaged for all possible starting times (tg) 

over all cells (N). For each time interval t, vectors from all cell trajectories were 
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used to compute average and sem. Error bars corresponding to s.e.m. are 
plotted, even if too small to be visible. b, Arpin-depleted MDA-MB-231 clones 
turn less than control cells (for two dimensions, n as indicated in Fig. 3a; 
P<0.05 between 10 and 40 min, Kruskal-Wallis; for three dimensions, n as 
indicated in Extended Data Fig. 8; P< 0.05 at time 10 min, Kruskal-Wallis). 
c, Arpin knockout amoebae turn less than wild-type amoebae, and GFP—Arpin 
overexpressing knockout amoebae (rescue) turn more than wild type 

(n as indicated in Fig. 3b; P< 0.05 between 5 and 85s, Kruskal-Wallis). 

e, Arpin-injected fish keratocytes turn more than buffer-injected cells, and 
ArpinAA-injected keratocytes turn more than buffer-injected but less than 
full-length-Arpin-injected keratocytes (n as indicated in Fig. 4b; P< 0.05 
between 16 and 272s, Kruskal-Wallis). 
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Directional tissue migration through a 
self-generated chemokine gradient 
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Lionel R. Newton!, Ana Fernandez-Minan't, Wolfgang Huber’, Michael Knop” & Darren Gilmour! 


The directed migration of cell collectives is a driving force of 
embryogenesis’ *. The predominant view in the field is that cells in 
embryos navigate along pre-patterned chemoattractant gradients’. 
One hypothetical way to free migrating collectives from the require- 
ment of long-range gradients would be through the self-generation 
of local gradients that travel with them*”, a strategy that potentially 
allows self-determined directionality. However, a lack of tools for 
the visualization of endogenous guidance cues has prevented the 
demonstration of such self-generated gradients in vivo. Here we 
define the in vivo dynamics of one key guidance molecule, the che- 
mokine Cxcl12a, by applying a fluorescent timer approach to mea- 
sure ligand-triggered receptor turnover in living animals. Using the 
zebrafish lateral line primordium as a model, we show that migrat- 
ing cell collectives can self-generate gradients of chemokine activity 
across their length via polarized receptor-mediated internalization. 
Finally, by engineering an external source of the atypical receptor 
Cxcr7 that moves with the primordium, we show that a self-generated 
gradient mechanism is sufficient to direct robust collective migra- 
tion. This study thus provides, to our knowledge, the first in vivo proof 
for self-directed tissue migration through local shaping of an extra- 
cellular cue and provides a framework for investigating self-directed 
migration in many other contexts including cancer invasion’. 

The zebrafish lateral line primordium is a migrating collective of 
approximately 100 cells that deposits a series of mechanosensory organs 
along the main body axis’ and whose route is defined by a stripe of the 
chemokine Cxcl12a (also known as Sdfla)*. Previous genetic studies 
have shown that the primordium can follow the Cxcl12a stripe efficiently 
in both directions’, excluding an instructive role for a pre-patterned 
chemokine gradient. Two ‘self-generated gradient’ hypotheses, whereby 
tissues sculpt directional gradients from uniform extrinsic guidance 
cues, have been proposed to explain this remarkable behaviour. First, 
Cxcl12a internalization by the guidance receptor Cxcr4b could generate 
a dynamic chemokine gradient at the tissue’s leading edge, which in 
mathematical models is sufficient to drive forward migration”. Second, 
directional symmetry could be broken by internalization of Cxcl12a at 
the tissue’s trailing edge by Cxcr7 (refs 4, 11), a receptor with known 
Cxcl12a sinking’*” and signalling'*** functions. However, a lack of tools 
for the visualization of endogenous guidance signal activity has pre- 
vented the validation of self-generated gradient models in any in vivo 
context. 

We therefore established a novel readout of chemokine activity by 
measuring receptor turnover at the cell surface, a well-characterized 
cellular response to ligand binding"®, using a tandem fluorescent protein 
timer (tFT) approach’’. To this aim, we tagged the zebrafish Cxcr4b 
receptor with a tFT consisting of the superfolder green fluorescent 
protein (sfGFP) coupled to the slower maturing red fluorescent protein 
TagRFP. The red/green fluorescence ratio at the plasma membrane, 
hereafter referred to as the ‘lifetime ratio’, provides a direct indicator of 
the age of protein populations, whereby newly synthesized proteins 


show only green fluorescence, with red fluorescence increasing pro- 
gressively with protein age. As the rate of ligand-triggered internaliza- 
tion determines how long receptors reside at the cell surface, receptor 
lifetime ratios would be predicted to decrease as chemokine levels 
increase (Fig. la). We first tested the response of the lifetime reporter 
in early-stage zebrafish embryos injected with defined concentrations 
of cxcl12a mRNA. This revealed that Cxcr4b-tFT lifetime ratios at the 
plasma membrane progressively decreased over a range of cxcll2a 
levels (Fig. 1b, c). Transgenic expression of the Cxcr4b-tFT reporter 
from a bacterial artificial chromosome (BAC) containing Cxcr4b cis- 
regulatory sequences (cxcr4b:cxcr4b-tFT) recapitulated Cxcr4b expression 
pattern and completely rescued the lateral line migration defect in 
Cxcr4b-deficient embryos (Extended Data Fig. 1), confirming that 
Cxcr4b-tFT was fully functional. Whole-mount embryo imaging 
revealed that Cxcr4b-tFT lifetime ratios varied in different embryonic 
tissues (Fig. 1d). A heat-shock-induced pulse of ubiquitous Cxcl12a 
(hsp70:cxcl12a) led to strongly enhanced receptor internalization and 
reduced lifetime ratios (Fig. le, f and Supplementary Video 1), with 
both the Cxcr4b-tFT reporter and endogenous receptor being sent for 
lysosomal degradation (Extended Data Fig. 2). Together, these experi- 
ments demonstrate that the Cxcr4b-tFT provides a dynamic readout 
of Cxcl12a chemokine activity in living embryos. 

The Cxcr4b-tFT reporter was then used to study the pattern of che- 
mokine activity across the lateral line primordium. Spatially resolved 
lifetime profiles along the tissue front—rear axis revealed a clear gradient 
in Cxcr4b activity, with twofold higher Cxcr4b-tFT lifetime ratios in 
follower cells than cells of the leading domain (Fig. 2a, c and Extended 
Data Fig. 3). Consistently, anti-Cxcr4b immunostainings confirmed 
that endogenous receptor levels are reduced at the plasma membrane 
in leading cells (Extended Data Fig. 4). To rule out the possibility that 
the observed gradient in receptor age results from a higher transcrip- 
tion in the front, we measured lifetime ratios of a membrane-tethered 
version of the tFT reporter (mem-tFT) that does not respond to the 
chemokine. A gradient in lifetime ratio was non-detectable when mem- 
tFT was expressed under identical genetic control (Fig. 2b, d and Extended 
Data Fig. 5). We conclude that the Cxcr4b-tFT lifetime ratio gradient 
across the primordium results from graded chemokine-triggered 
receptor turnover. Furthermore, imaging of Cxcr4b-tFT lifetime ratios 
across the primordium during recovery from a pulse of uniform Cxcl12a 
chemokine expression revealed that the gradient in receptor turnover 
was re-established as the effect of the chemokine pulse subsided 
(Fig. 2e, fand Supplementary Video 2). These data are consistent with 
the proposal that the primordium can self-generate a front-rear che- 
mokine activity gradient that it dynamically maintains through inter- 
action with its environment. 

The establishment of a method to visualize the chemokine activity 
across the migrating tissue allowed us to determine the role of the endo- 
genous chemokine receptors in shaping this gradient (Fig. 3a). Genetic 
inactivation of cxcr4b (mutant allele cxcr4b’?©> (ref. 18)) decreased 


EMBL Heidelberg, MeyerhofstraBe 1, 69117 Heidelberg, Germany. @Zentrum ftir Molekulare Biologie der Universitat Heidelberg, Deutsches Krebsforschungszentrum, DKFZ-ZMBH Allianz, Im 
Neuenheimer Feld 282, 69120 Heidelberg, Germany. +Present addresses: CABD, Universidad Pablo de Olavide, Carretera de Utrera km1, 41013 Seville, Spain (A.F.-M.); MRC National Institute of Medical 


Research, The Ridgeway, Mill Hill, London NW7 1AA, UK (G.V.). 


14 NOVEMBER 2013 | VOL 503 | NATURE | 285 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a ss b 

Cxer4b L\ Steady G excrdb-tFT mRNA excaESET 
lifetime state a 
reporter e pf Q Ho aeliea) 
wi < 5 & 8 

o glle— s 2 ae 

fe ae] : = 

a si|5,>5| Pe) 2 excr4b-tFT mRNA 

— a 2 2 5 excl12a MRNA 

my S18) s a 

“A Sls cak: 

On s, 3 ves ina 
= 3 ina 
tFT. LI 

© oo 137 — 

@ n=20 ™ 

@ 1.1 ee 

£ 

2 0.9 n= 290 

Eo74 + = 

# os 4 

© et Posterior lateral lines 
° 03 primordium 

0 10; 25; 50 
excl12a mRNA (ng ul’) 


excr4b:cxcr4b-tFT e ee 
(hsp70:cxcl12a) je) 
vi Steady 
P State 


a 
= ij ————f 


Before heat-shock 


Time 


+30 min 


+330 min 


Cxcr4b-tFT lifetime ratios (Fig. 3c—e), indicating that the endogenous 
receptor reduces Cxcl12a levels available to the Cxcr4b-tFT reporter. 
Lifetime ratios were more strongly reduced in cxcr7 mutants (cxcr7*"° 
(ref. 19)), despite its expression being restricted to cells at the tissue rear 
(Fig. 3a—c). Conversely, knocking down cxcl12a in cxcr7 mutants increased 
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Figure 1 | Tandem fluorescent protein timers 
(tFTs) allow quantification of chemokine- 
dependent receptor turnover in vivo. a, Schema 
describing experimental rationale. Red/green 
intensity ratio increases as a function of protein age 
(left). Chemokine-dependent receptor- 
internalization rate determines ratio at plasma 
membrane (right). b, Top, schematic outline (left) 
and images of 7 h post-fertilization (h.p.f.) embryos 
injected with cxcr4b-tFT (middle) or cxcr4b-tFT 
plus cxcll2a mRNA (right), scale bar, 20 um. 
Bottom, ratiometric images of squared areas 
(calibration bar = normalized red/green ratio). 

c, Box-and-whiskers plot of Cxcr4b-tFT lifetime 
ratios as a function of cxcll2a mRNA 
concentration (one-way analysis of variance 
(ANOVA) test with Dunnett’s multiple 
comparison procedure; control group: Ong pl 
cxcll2a mRNA; **P < 0.01). d, BAC 
cxcr4b:cxcr4b-tFT embryo at 32 h.p.f. Inset, 
magnification of dashed area (scale bars: embryo, 
200 kum; inset, 50 jm). e, Cxcr4b-tFT response to 
heat-shock-induced expression of Cxcl12a 
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eps is oe (hsp70:cxcl12a, Supplementary Video 1). Scale 
. 0.4 bars: primordium, 20 jim; insets, 5 jum. Insets, 
£ n=12 magnification of boxed areas (brightness increased 
£ 08 se in bottom inset for visualization). f, Tissue-scale 
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lifetime ratios to levels similar to cxcl12a null mutants (cxcl12a°™!° 


(ref. 20), Fig. 3c), confirming that the effect of Cxcr7 on Cxcr4b turn- 
over is through regulating chemokine availability. Spatially resolved 
profiles revealed a pronounced reduction and flattening of Cxcr4b- 
tFT lifetime ratios in the rear two-thirds of cxcr7 mutant primordia 
(Fig. 3b, e). Interestingly, Cxcr4b-tFT lifetime ratio was also decreased 
in the leading domain that does not express Cxcr7, consistent with the 
idea that Cxcr7 regulates Cxcl12a availability across the entire prim- 
ordium (Fig. 3e, f). Direct testing of the Cxcr7 sink model came from 
expressing a fluorescently tagged version of Cxcl12a (Cxcll12a—GFP) in 
cells adjacent to the migrating primordium. Cxcll2a—-GFP accumu- 
lated in cells at the rear of the tissue, in Cxcr7-positive endosomes 
(Fig. 3g, h, Supplementary Video 3 and Extended Data Fig. 6). Thus, 
although the extracellular chemokine gradient generated by the prim- 
ordium cannot be directly visualized using currently available tools, this 
pattern of increased accumulation of Cxcll2a—GFP at the tissue rear 
probably represents a ‘complementary image’ of its extracellular dis- 
tribution. We therefore conclude that Cxcr7 sink function at the tissue 
rear patterns chemokine activity across the migrating primordium. 
We next determined whether the activity of these receptors as sinks 
for Cxcl12a is of functional importance during collective migration. To 
this aim, we compared the ability of wild-type and internalization- 
defective versions of Cxcr4b (Cxcr4b* (ref. 16)) and Cxcr7 (Cxcr7*, 
Extended Data Fig. 7) to alter primordium migration. We expressed 
each receptor form using a primordium-specific Gal4 line that drives 


Figure 2 | Ligand-regulated chemokine receptor lifetime gradient indicates 
self-generated migration. a, b, Primordia expressing cxcr4b:cxcr4b-tFT (a) ora 
membrane-tethered version that does not respond to Cxcl12a (cxcr4b:mem-tFT) 
(b). ¢, d, Spatially resolved lifetime ratio profiles of Cxcr4b-tFT-expressing 

(c) and mem-tFT-expressing (d) primordia at 32 h.p.f. Inset, tissue-scale lifetime 
ratios (box-and-whiskers plot, Welch’s t-test, ***P < 0.001). e, Cxcr4b-tFT 
ratiometric images after pulse of Cxcl12a (hsp70:cxcl12a, Supplementary 
Video 2). f, Quantification of images shown in e. Scale bars, 10 um. 
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Figure 3 | Role of Cxcr4b and 
Cxcr7 sink activities in chemokine 
gradient formation. a, Expression 
pattern of Cxcr7 (anti-Cxcr7 
antibody (Ab), green in bottom 
panel) and Cxcr4b (Cxcr4b-GFP, 
red in bottom panel). Cxcr7 vesicles 
cluster in apically constricted cells of 
organ progenitors (asterisks, insets). 
b, Ratiometric images of Cxcr4b-tFT 
primordia in wild type (WT, top), 

f cxcl12a_‘~ (middle) and cxcr7 /~ 
(bottom). Calibration bar shows 

the normalized red/green ratio. 

c, Box-and-whiskers plot of tissue- 
scale Cxcr4b-tFT lifetime ratios at 
32 h.p.f. (Welch’s t-test; asterisks 
refer to comparison with WT if not 
specified otherwise; ***P < 0.001). 
cxcll2a Mo, cxcl12a morpholino. 

d, Schematic interpretation of data in 
c. e, Spatially resolved Cxcr4b-tFT 
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high-level expression in a mosaic pattern across the collective*’. Over- 
expression of wild-type Cxcr4b-GFP had no influence on primordium 
migration and was able to rescue cxcr4b mutant primordia (Fig. 3)). 
Overexpression of Cxcr4b*-GFP gave similar results in both assays 
(Fig. 3i, j), demonstrating that Cxcr4b sink activity is dispensable for 
directional collective migration. By contrast, overexpression of wild- 
type Cxcr7-GFP markedly impaired migration (Fig. 3i, j) and caused 
elongation of the tissue (Extended Data Fig. 8), as expected from 
expanding the expression of a Cxcl12a sink within the primordium. 
By comparison, overexpression of Cxcr7*-GFP showed strongly reduced 
ability to interfere with migration (Fig. 3j), indicating that internaliza- 
tion is necessary for Cxcr7 activity in this context. 

To test whether Cxcr7 sink activity is sufficient for migration, as 
predicted by the self-generated gradient model, we engineered an 
external source of Cxcr7 that followed the collective, achieved by expres- 
sing the receptor in the posterior lateral line nerve (pLLN) that extends 
under the primordium en route without influencing its movement” 
(Fig. 4a and Supplementary Video 4). Nerve expression of Cxcr7—GFP, 
achieved using a neural-specific promoter” (nerve:cxcr7-GFP) proved 
sufficient to interfere with the migration and morphology of wild-type 
primordia, with speed being reduced and the primordium becoming 


Normalized migrated distance (-) 


way ANOVA test with Bonferroni’s 
multiple comparison adjustment, 
*P <0.05, **P<0.01, 

*** P< 0,001). Scale bars, 10 um 
(20 um in g). 


elongated (Fig. 4b, cand Supplementary Video 5). Interestingly, migra- 
tion decelerated when the Cxcr7-expressing nerve advanced into the 
leading domain (Fig. 4c), indicating that the position of the Cxcr7 sink 
dynamically influences tissue behaviour. Moreover, nerve:cxcr7-GFP 
expression reinstalled directional migration in cxcr7 mutant primordia 
with remarkable efficiency (Fig. 4d, e). Time-lapse imaging of these 
nerve-guided tissues again highlighted the importance of Cxcr7 sink 
positioning, with directional migration arresting when the nerve advanced 
into the leading domain and recommencing when it retracted to trail- 
ing regions (Fig. 4f, g and Supplementary Videos 6-8). Cxcr4b-tFT 
lifetime ratio analysis revealed a chemokine activity gradient across 
primordia in nerve:cxcr7-rescued embryos when compared to cxcr7 
mutant siblings, confirming that extrinsic Cxcr7 sink activity is suf- 
ficient to promote both directional migration and self-generated gra- 
dient formation (Fig. 4h-j). 

The data presented here provide compelling experimental support 
for the idea that migrating collectives can determine their own direc- 
tionality via a self-generated gradient mechanism. This has a number 
of implications for our understanding of cell movement in complex 
systems. Most obviously, autonomous dynamic remodelling of extra- 
cellular guidance cues by migrating tissues circumvents the necessity 
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Figure 4 | Chemokine gradient formation by 
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for pre-existing long-range gradients, which may have spatial limita- 
tions. Moreover, it could coordinate the migration of multiple collec- 
tives within organisms, where gradients generated by one moving 
tissue accelerate or retard the migration of another. Our finding that 
the relocation of Cxcr7 allows the trailing nerve to direct the tissue that 
normally guides it shows that such higher-level coordination is possible. 
These experiments further show that signalling functions assigned to 
Cxcr7, such as B-arrestin recruitment’, regulation of Cxcr4 expression* 
or heterodimerization with Cxcr4 (ref. 24), are dispensable for this 
process, thus providing further support for signalling-independent 
activities for Cxcr7 in vivo'* and encouraging us to re-evaluate its role 
in lateral line migration’? (Supplementary Discussion and Extended 
Data Fig. 9). Here we have used protein lifetime reporters to reveal an 
autonomously generated chemokine activity gradient across the axis of 
a migrating collective. Although a number of cellular mechanisms could 
influence receptor turnover rates across tissues”, we demonstrate that 
Cxcr7 sink activity is required for and sufficient to pattern chemokine 
activity across the primordium. In the future it will be interesting to 
study how the tissue-scale chemokine gradient is converted to directed 
collective migration. For example, whether the self-generated gradient 
acts at the single-cell level, by directing the migration of individual cells 
within the leading domain, remains an open question. An attractive 
alternative, albeit non-mutually exclusive, proposal is that the function 
of Cxcr7 is to polarize motility at the tissue-scale by reducing chemo- 
kine activity levels to below a threshold, thus allowing follower cells 
behind the leading domain to adopt a different migratory behaviour. 
Finally, the generic nature of the lifetime imaging approach described 
here should facilitate the analysis of guidance signalling and self- 
generated gradient formation in other in vivo contexts. Given that 
Cxcr4 (ref. 26) and Cxcr7 (refs 27, 28) have important roles in cancer, 
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it is tempting to speculate that a mechanism of this kind could promote 
the self-directed migration of tumour cells*. 


METHODS SUMMARY 

Generation of Cxcr4b-tFT lifetime reporter BACs. Cxcr4b was tagged at the 
carboxy terminus with a TagRFP-sfGFP tFT through modification of the BAC 
clone CH211-145M5, containing 250 kilobases of genomic sequence, using stand- 
ard ET recombineering technology” (Gene-Bridges). 

Imaging of tFT reporter lines. Embryos were mounted in 1% low melting point 
(LMP) agarose in glass-bottom dishes (MatTek Corporation) and imaged on a Zeiss 
LSM 780 confocal microscope. High-resolution z-stacks of primordia were cap- 
tured using a 63 water immersion objective (C-APOCHROMAT, 1.2 numerical 
aperture, Zeiss). Green and red signals were acquired sequentially using Gallium- 
Arsenide-Phosphide (GaAsP) detectors. A reference solution of purified mCherry- 
sfGFP fusion protein’” was used to normalize for intensity fluctuations affecting 
the ratio. Low-resolution z-stacks of the entire embryo were acquired with a 20x 
objective (EL Plan-NEOFLUAR, 0.5 numerical aperture, Zeiss). Red and green 
images correspond to merged Gaussian filtered and flattened (sum projection) 
dual-colour z-stacks, whereas ratiometric images were generated from sum pro- 
jections of segmented z-stacks. Colour-coding, if not specified otherwise, indicates 
red/green ratio after normalization by the reference solution. 

Image analysis of lifetime ratios. Custom software written in R was used for auto- 
mated data analysis, using many features of the open source R/Bioconductor image 
analysis package EBImage’’. The complete image analysis code is available as part 
of the Bioconductor data package DonaPLLP2013 (http://www.bioconductor.org). 
Statistical analysis. Statistical analysis of tissue-scale lifetime ratios was performed 
in R (Extended Data Fig. 10). The data set and vignette required to reproduce this 
analysis with runnable R code are available as part of the Bioconductor data package 
DonaPLLP2013 (http://www.bioconductor.org). GraphPad Prism 4.0 (GraphPad 
Software) was used for all other statistical analysis. Details of statistical tests are 
indicated in the figure legends. In box-and-whiskers plots the middle bar is the 
median, the hinges show the interquartile range (IQR) and the notches extend to 
+ 1.58 IQR/sqrt(n), where n is the number of samples. 
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METHODS 


Zebrafish stocks. Zebrafish (Danio rerio) strains were maintained and staged as 
described previously*'. Embryos were raised in E3 buffer at 30 °C, unless otherwise 
specified. The following mutant and transgenic strains were used: cxcr4b”™ (ref. 18), 
cxcl12a'*°"° (ref. 20), cldnb:lyn-GFP”, cxcr7™"° (ref. 19), cxer4a“””? (ref. 32). GAL4- 
UAS:mCherry (ETL GA346) line was provided by R. Késter’'. BAC transgenic lines 
are described below. Other transgenic lines were made using the Tol2kit’’. All 
zebrafish experiments were conducted on embryos before 3 days post-fertilization, 
under the rules of the European Molecular Biology Laboratory and the guidelines 
of the European Commission, Directive 2010/63/EU. 

mRNA synthesis and plasmids list. Capped mRNA was synthesized using 
mMessageMachine kit (Ambion). Used plasmids, generated as multisite-gateway 
clones (Invitrogen) using the Tol2kit*, are listed below (in brackets the combined 
entry clones are indicated). 6 UAS promoter was provided kindly by R. Koster. 
Cxcr4b and Cxcr4b* constructs were a gift from the E. Raz laboratory (Cxcr4b* 
corresponds to construct 6 in ref. 16). Cxcr7* was created by removing the C- 
terminal 40 amino acids from Cxcr7 complementary DNA”’, following the stra- 
tegy used in ref. 34 (Extended Data Fig. 7). Plasmids list is included here, where 
NBT = Xenopus neural-specific §-tubulin promoter” (corresponds to nerve pro- 
moter), mem = membrane tethering myristoylation sequence from Lyn kinase, 
T2A = viral 2A peptide, CAAX = CAAX prenylation sequence: SP6:Cxcr4b-TagRFP- 
sfGFP (p5'E(SP6/CMV)/pME(Cxcr4b noStop)/p3’E(TagRFP-sfGFP)), 6X UAS: 
Cxcr4b-GFP (p5’E(6xUAS)/pME(Cxcr4b noStop)/p3’E(GEFP)), 6X UAS:Cxcr4b*- 
GFP (p5’E(6xUAS)/pME(Cxcr4b* noStop)/p3’E(GFP)), 6X UAS:Cxcr7-GFP 
(p5’E(6xUAS)/pME(Cxcr7 noStop)/p3’E(GEP)), 6% UAS:Cxcr7*-GEP (p5’E(6xUAS)/ 
pME(Cxcr7* noStop)/p3’E(GFP)), hsp70:Cxcl12a (p5’E(hsp70)/pME(MCS)/ 
p3’E(Cxcl12a)), nerve:Cxcr7-GFP (p5’E(NBT)/pME(Cxcr7 noStop)/p3’E(GFP)), 
nerve:Cxcr4b-GFP (p5’E(NBT)/pME(Cxcr4b noStop)/p3’E(GEFP)), nerve:mem- 
GFP (p5'E(NBT)/pME(mem noStop)/p3’E(GFP)), nerve:Cxcr7 (p5'E(NBT)/ 
pME(Cxcr7)/p3'E(polyA)), hsp70:Cxcl12a-GFP_T2A_mKate2-CAAX (p5’E(hsp70)/ 
pME(Cxcl12a-GFP)/p3’E(T2A_mKate2-CAAX)). 

BAC transgenesis. Four BACs were generated by modification of Cxcr4b BAC clone 
CH211-145M5 using the ET recombineering system” (Gene Bridges). The following 
selectable targeting cassettes were used for recombination. Targeting cassettes: (1) 
Lyn-tdTomato/late SV40 poly(A) signal/FRT-Kan®-FRT; (2) TagRFP(noStop)- 
(GlyAla);-sfGFP/late SV40 poly(A) signal/FRT-Kan®-FRT; (3) Lyn-TagRFP 
(noStop)-(GlyAla);-sfGFP/late SV40 poly(A) signal/FRT-Kan®-FRT; (4) cry:CFP/ 
Kan®; (5) GFP/late SV40 poly(A) signal/FRT-Kan*-FRT (in which cry = y-crystallin 
promoter®* and CFP = cyan fluorescent protein). 

The following primers, encoding for left and right homology arms, were used 
to amplify the listed selectable targeting cassettes (indicated in brackets). The 
sequence homologous to the PCR template is underlined. 

Forward primers (left homology arms): (1) 5’-GCGTGTCTCAGGGAACGGA 
GGACTATTTAAATGCTGCTGTGAGAGGAGGGCGGTTGACTGAAGTGTG 
GAGACTTATTGCGCCTTTTTGAGCGAGCCTCTAGAGCCACCATG-3’; (2) 
5'-GATGCTGACCAAGAAAAGGGGGCCTATATCATCTGTATCTACTGAA 
TCAGAGTCGTCCAGTGCACTGACGAGTATGGTGTCTAAGGGCGAAG-3’; 
(3) 5'-GCGTGTCTCAGGGAACGGAGGACTATTTAAATGCTGCTGTGAGA 
GGAGGGCGGTTGACTGAAGTGTGGAGACTTATTGCGCCTTTTTGAGCA 
AGTTTGTACAAAAAAGCAGGCT-3’; (4) 5'-CGCGGGGCATGACTATTGG 
CGCGCCGGATCGATCCTTAATTAAGTCTACTAATTATGATCCTCTAGAT 
CAGATCT-3’; (5) 5'-TCCAGTCACAAGATGCTGACCAAGAA AAGGGGGCC 
TATATCATCTGTATCTACTGAATCAGAGTCGTCCAGTGCACTGACGAGT 
ATGGTGAGCAAGGGCGAGGAG-3’. 

Reverse primers (right homology arms): (1,5) 5’-CATGATACACTTAAACTA 
GTCTTATTAATAATACTTATCAATGGAATGTCTGATAACACAGCGTTAT 
AAATTAATCACAAGCCAACTTACCTAAAGGGAACAAAAGCTGGGTACC-3’; 
(2) 5’-ACACAAAAATACTTTACAATGTACAAAAACTGTAGTAAAGTCCT 
GTTTTTATAAGCTTAATCATCCATGTGGGCACCCGTGGCCGTATCT-3’; 
(3) 5’-CATGATACACTTAAACTAGTCTTATTAATAATACTTATCAATGG 
AATGTCTGATAACACAGCGTTATAAATTAATCACAAGCCAACTTACGC 
ACCCGTGGCCGTATCT-3’; (4) 5’-GCGTAAGCGGGGCACATTTCATTACC 
TCTTTCTCCGCACCCGACATAGATATTACCCTGTTATCCCTAGAAACAG 
CTATGACCATGTAA-3’. 

For cxcr4b:NLS-tdTomato, cassette 1 was inserted after cxcr4b first intron, 
leaving the first 5 amino acids encoded by the first exon, which when fused to 
the membrane-tethering myristoylation sequence from Lyn kinase gave rise to a 
nuclear localization sequence (NLS). cxcr4b:cxcr4b-tFT (receptor C-terminal 
fusion) and cxcr4b:mem-tFT (receptor transcriptional reporter, a tFT tethered to 
the membrane by the mem sequence) were generated in two steps. First, cassettes 2 
and 3 were inserted to replace Cxcr4b STOP and ATG starting codons, respec- 
tively. Then, cry:CFP transgenic marker (cassette 4) was introduced into the 
pTARBAC2.1 backbone. For cxcr4b:cxcr4b-GFP (receptor C-terminal fusion) 


BAC line Cxcr4b STOP codon was replaced by cassette 5. FRT-flanked kanamycin- 
resistance cassette (FRT-Kan®-FRT) was removed with FLP recombinase in all 
modified BACs. The inserted fragments and the regions flanking the recombina- 
tion sites were sequenced before purification of positive clones (Large Construct 
Kit, Qiagen). Embryos were injected at one-cell stage, raised to adulthood and 
screened for fluorescence expression. 

cxcl12a knockdown. Previously described cxcl12a morpholino MO3 (ref. 12) (5'- 
CTACTACGATCACTTTGAGATCCAT-3’, Gene Tools) was diluted to 1 mM 
and injected at one-cell stage. Injection efficacy was monitored by scoring the 
number of arrested primordia in injected wild-type control embryos. 

Analysis of Cxcr4b-tFT lifetime ratio in early embryos. cxcr4b-tFT mRNA 
(200 ng pl ', drop size ~0.6 nl) was injected alone or in the presence of titrating 
concentrations of cxcl12a mRNA (10,25 and 50 ng pil ') in one-cell stage embryos. 
Dechorionated 7 h.p.f. embryos were mounted in 0.8% low melting point (LMP) 
agarose and high-resolution confocal images were acquired as described in the section 
‘Imaging of tFT reporters’. After background subtraction, membrane fragments 
were manually defined using the freehand line tool in Fiji (line width = 0.9 um) 
followed by calculation of the mean red and green fluorescence intensities. Red/ 
green ratios were normalized by the ratio of the sfGFP-mCherry reference solution 
and averaged per embryo. One embryo represents one data point. 

Imaging of tFT reporters. Embryos were mounted in 1% LMP agarose in glass- 
bottom dishes (MatTek Corporation) and imaged on a Zeiss LSM 780 confocal micro- 
scope at 30 °C. High-resolution 16-bit z-stacks of primordia (pixel size = 0.131 um, 
z-space = 0.5 or 1 tm) were captured using a 63X water immersion objective 
(C-APOCHROMAT, 1.2 NA, Zeiss). Green and red signals (green: ex. 488 nm, 
em. 489-550 nm; red: ex. 561nm, em. 568-664nm, A.U. = 1) were acquired 
sequentially by line using Gallium-Arsenide-Phosphide (GaAsP) detectors. A ref- 
erence solution of purified mCherry-sfGFP fusion protein’’ was used to normalize 
for intensity fluctuations affecting the ratio. Low-resolution 16-bit z-stacks of the 
entire embryo (z-space = 10 tum, pixel size = 0.83 jm) were acquired with a 20x 
objective (EL Plan- NEOFLUAR, 0.5 NA, Zeiss). Red and green images correspond 
to merged Gaussian filtered and flattened (sum projection) dual colour z-stacks, 
whereas ratiometric images were generated as described in the section ‘Generation 
of ratiometric images’. 

Image analysis of lifetime ratios. The complete image analysis code is available as 
part of the Bioconductor data package DonaPLLP2013 (http://www.bioconductor.org). 
Blind sample selection was not possible, as mutant phenotypes were easily iden- 
tifiable; however, specimens of each genotype were randomly selected from age- 
matched pools and lifetime ratios were extracted using an automated analysis 
pipeline, preventing bias in assessing the outcome of each experiment. In brief, 
z-stacks of primordia were acquired as described above and corrected for back- 
ground. The green channel was chosen as the reference channel owing to the fast 
maturation kinetics of sfGFP and, from this, a binary mask of the primordium 
membrane (membrane-mask) was obtained by smoothing and adaptive thresh- 
olding. Then, a binary tissue-mask was computed and the axis of migration, along 
which all samples were aligned, was extracted using principal component analysis. 
Only pixels lying within the intersection of membrane-mask and tissue-mask 
(tissue-membrane-mask) were considered for subsequent calculations. To reduce 
the effects of pixel noise and chromatic aberration, a three-dimensional running 
median across the tissue (cube side-length of 10 j1m) was computed in both chan- 
nels. Red/green fluorescence intensity ratios were then calculated and normalized 
using a mCherry-sfGFP” reference solution acquired with the same imaging 
conditions. This procedure was necessary to compensate for day-to-day fluctua- 
tions of the instrument and permitted direct comparison of samples imaged on 
different sessions. To obtain tissue-scale lifetime ratios we calculated the median of 
normalized red/green fluorescence intensity ratios within the tissue-membrane- 
mask. Alternatively, to obtain spatially resolved lifetime ratio profiles, the median 
ratio along the axis of migration x (that is, through yz) was computed. 
Generation of ratiometric images. From the image analysis we obtained back- 
ground subtracted, Gaussian smoothed and membrane-masked z-stacks of the 
primordium in both red and green channels, which were further processed with 
Fiji. Stacks were flattened using sum projection, red/green ratio images were 
created using the Image Calculator plugin. These were then further normalized 
by dividing by the ratio of the reference solution. For visualization, images were 
colour-coded using the ratio lookup table in Fiji. 

Statistical analysis. Statistical analysis of lifetime ratios was performed using R. 
The data set and vignette required to reproduce the analysis with runnable R code 
are available as part of the Bioconductor data package DonaPLLP2013 (http:// 
www.bioconductor.org). To generate the analysed data set, consisting of normal- 
ized tissue-scale lifetime ratios obtained as described in the section ‘Image analysis 
of lifetime ratios’, embryos were carefully staged and always imaged at the same 
developmental stage (32 h.p.f.) to ensure comparability. Moreover, a calibration 
solution of purified mCherry-sfGFP fusion protein’” was used to normalize for 
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variability in imaging conditions. As the effect size was not known a priori, we ran 
initial samples to obtain a first estimate of it. From these experiments we estimated 
that approximately n > 20 samples would be sufficient to obtain enough statistical 
power; however, we imaged sample sizes of up to n = 46, from at least three 
technical replicates per condition. Subsequent statistical testing demonstrated high 
statistical power owing to these large sample sizes. The distribution of all tissue- 
scale ratios is shown in Extended Data Fig. 10a, allowing inspection of the variation 
between samples. Consistency with assumptions of normal distribution of the data 
was assessed by generation of quantile-quantile (Q-Q) plots for each condition 
individually (Extended Data Fig. 10b). These indicated that the data were suffi- 
ciently close to being normally distributed, therefore two-sided Welch’s t-tests for 
each of the comparisons of interest and multiple testing correction across these 
comparisons with Bonferroni’s method were performed. An alternative, non- 
parametric test, the two-sided Mann-Whitney test (a two-sample Wilcoxon test), 
returned comparable results leading to the same biological interpretation of the 
data (Extended Data Fig. 10c). 

GraphPad Prism 4.0 (GraphPad Software) was used for all other statistical 
analysis. Details of statistical tests are indicated in the figure legends. In box- 
and-whiskers plots the middle bar is the median, the hinges show the interquartile 
range (IQR) and the notches extend to + 1.58 IQR/sqrt(), where n is the number 
of samples. 

Heat-shock induction of Cxcl12a. Embryos were raised and imaged at 28 °C to 
avoid leakage of the hsp70 promoter and heat-shocked at 38 °C in a water bath for 
30 min if not indicated otherwise. Imaging was performed as described in the 
section ‘Imaging of tFT reporters’. For time-lapse experiments four tiles were 
acquired and the area corresponding to the primordium was manually defined 
for further analysis. A time interval of approximately 1h was chosen to minimize 
photobleaching and allow multi-position acquisition. 

Mosaic expression of Cxcl12a-GFP. Embryos were injected at one-cell stage with 
hsp70:cxcl12a-GFP_T2A_mKate2-CAAX, raised at 28 °C. 32 h.p.f. embryos were 
heat-shocked at 38 °C for 1h, mounted as described and imaged on a Zeiss LSM 
780 confocal microscope (63 objective, GaAsP detector, pixel size = 0.131 pm). 
Owing to low intensity of the GFP signal the pinhole was opened to 2 a.u. Stacks 
were flattened by maximum intensity projection for visualization and by sum 
projection to calculate mean fluorescence intensities across the primordium mid- 
line (line width = 7.2 um). 

Immunostaining. Whole-mount antibody staining was performed following 
standard procedures*'. Monoclonal anti-Cxcr7 and anti-Cxcr4b (zebrafish) antibodies 
were generated by the EMBL Monoclonal Antibody Facility using the following 
peptides sequences as antigens: MSVNVNDFNDILDALGELNFC and MEFYDS 
IILDNSSDSGSGDYDGEELC, corresponding to the amino terminus of Cxcr7 and 
Cxcr4b receptors, respectively. Antibody specificity was confirmed by demonstrat- 
ing a lack of immunostaining pattern in embryos mutant for the respective recep- 
tor. Rabbit-anti GFP (Torrey Pines Biolabs) was used for staining of Cxcll12a—GFP. 
Goat anti-mouse Alexa Fluor 647 and goat anti-rabbit Alexa Fluor 488 (Molecular 
Probes) were used as secondary antibodies. Imaging was performed on either a 
Zeiss LSM 780 confocal microscope as described or a PerkinElmer Improvision 
Ultraview VoX Spinning disk confocal. In this case, a 63X water immersion 
objective (C-APOCHROMAT, 1.2 NA, Zeiss, pixel size = 0.106 jum, z-space = 0.5 jum) 
and the following imaging settings were used: sequential acquisition; dichroic: 
multi pass 405/488/568/640; green: ex. 488 nm, em. single BP 525(W50); far red: 
ex. 640nm, em. double BP 485(W60), 705(W90). Images were smoothed by 
Gaussian filter and flattened using maximum intensity projection in Fiji. 
Time-lapse imaging and primordium migration analysis. Time-lapse experi- 
ments for the analysis of primordium migration were carried out at 30°C ona 
PerkinElmer Improvision Ultraview VoX Spinning disk confocal using 25 multi- 
immersion objective (Plan-APOCHROMAT, 0.8 NA, Zeiss). Image analysis was 
performed in Fiji. z-stacks spanning the whole primordium (z-space = 1 jum) were 
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flattened by maximum intensity projection and stitched using the Pairwise 
Stitching plugin’? where necessary. Kymographs were generated using the 
Kymograph plugin (line width = 2 um). When GFP fluorescent nerve was present, 
the furthermost axon growth cone was manually indicated with a point on the 
migration path before creating the kymograph. 

Time-lapses for the analysis of primordium migration speed (Fig. 4c) were 
3-5h long (time interval = 10 min). Primordium and nerve tips were tracked 
using the Manual Tracking plugin. Primordium tip instantaneous velocities were 
extracted (directionality: plus = anterior to posterior, minus = posterior to anterior) 
and binned depending on the distance between the tip of Cxcr7-GFP-positive 
nerve and the leading edge of the primordium at that time-point (binning win- 
dow = 25 1m). A minus sign was assigned to the distance when the nerve tip was in 
front of the primordium. Sixteen movies per condition (wild type or wild type plus 
nerve:Cxcr7) were analysed. 

Overviews of 48 h.p.f. embryos were acquired using a Leica MZ16F fluorescence 
stereomicroscope. Distances from the ear to the primordium and from the ear to 
the tail of the fish were measured using Fiji and their ratio was calculated. 
LysoTracker Red DND-99 staining. Dechorionated embryos were incubated for 
90 min in 1M LysoTracker Red DND-99 dye (Molecular Probes) in fish E3 
medium plus 1% dimethylsulphoxide (DMSO). 

Bafilomycin A1 Inhibition of lysosomal function. Dechorionated embryos were 
incubated in E3 medium plus Bafilomycin Al (Sigma) at a final concentration of 
300 nM for 3h or 3 4M for 40 min. Embryos were then either fixed in 4% para- 
formaldehyde or mounted in 1% LMP agarose and imaged immediately. 
Quantification of endogenous Cxcr4b degradation in response to Cxcll2a 
overexpression. cldnb:lyn-GFP/hsp70:cxcl12a double-transgenic embryos were 
heat-shocked for 1h at 38 °C and subsequently incubated for 3 h in the presence 
of Bafilomycin A1 at a final concentration of 300 nM or 0.01% DMSO (control). 
Control embryos (cldnb:lyn-GFP without hsp70:cxcl12a) were also heat-shocked. 
Embryos were then fixed in 4% paraformaldehyde and stained for Cxcr4b using 
the described monoclonal anti-Cxcr4b antibody. z-stacks spanning the whole 
primordium were acquired on a Zeiss LSM 780 confocal microscope. After back- 
ground subtraction (rolling ball algorithm in Fiji) images were flattened using sum 
projection and Cxcr4b total fluorescence intensity was calculated within a prim- 
ordium mask defined using the Lyn—GFP counter-label. 

Analysis of Cxcr4b distribution across the primordium. cldnb:lyn-GFP tran- 
genic embryos were fixed at 32h.p.f. and stained for Cxcr4b using the afore 
mentioned monoclonal anti-Cxcr4b antibody. Confocal stacks were acquired on 
a Zeiss LSM 780 as already described for tFT reporters imaging. The same pipeline 
used for the ratiometric analysis of Cxcr4b-tFT was followed, with two differences: 
the intersection of the green and red individual masks was used as membrane 
mask and profiles were normalized by mean ratio in the front 15 jm of the tissue. 
Cxcr4a’s role in primordium migration. Whole-mount in situ hybridization of 
Cxcr4a was performed using standard procedures. Cxcr4a probe and cxcr4a“”"”° 
(ref. 32) were provided by A.F. Siekmann. cxcr4b:NLS-tdTomato/cxcr4a"?”* 
transgenic fish were in-crossed and primordium migration analysis was performed 
as already described. 
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Extended Data Figure 1 | cxcr4b:cxcr4b-tFT BAC rescues primordium (cry:CFP, asterisks). b, In all cxcr4b:cxcr4b-tFT transgenic embryos the 
migration defect in cxcr4b mutants. a, Image of cxcr4b mutant embryos, primordium migrated normally, whereas non-transgenics showed migration 
where primordium migration (white arrowheads) is made visible by defects typical of cxcr4b mutants”’. Dot-plot, median and interquartile range of 
cxcr4b:NLS-tdTomato (red). Fish carrying cxcr4b:cxcr4b-tFT transgenes are the indicated number of samples are shown (Welch’s t-test, ***P < 0.0001). 


identifiable by an independent transgenic marker that labels the lens blue 
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Extended Data Figure 2 | Cxcl12a-induced internalization targets Cxcr4b 
to lysosomal degradation. a—c, Cxcr4b is sent for lysosomal degradation in 
wild-type embryos. a, cxcr4b:cxcr4b-GFP embryo stained with the lysosomal 
marker LysoTracker Red DND-99. Left, maximum intensity projection of a 
primordium confocal z-stack (scale bar, 20 zm) showing Cxcr4b-GFP (green) 
and LysoTracker (red) localization. Right, magnification of single confocal 
plane showing LysoTracker and Cxcr4b-GFP co-localization (arrowheads; 
scale bar, 5 um). b, Impaired lysosomal acidification leads to accumulation of 
endogenous Cxcr4b in intracellular vesicles. Anti-Cxcr4b (maximum intensity 
projections) of Bafilomycin Al-treated embryos (bottom) and controls (top; 
DMSO (dimethylsulphoxide)). c, Bafilomycin Al treatment results in an 
increased number of Cxcr4b-tFT vesicles. Images show maximum intensity 
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projections of z-stacks (scale bar, 20 um). d-f, High Cxcl12a levels cause 
increased degradation of endogenous Cxcr4b. d, Anti-Cxcr4b 
immunohistochemistry 3 h after heat-shock induction (1h at 38 °C) of 
hsp70:cxcl12a (sum projection of confocal z-stacks). e, Quantification of Cxcr4b 
levels within the primordium (box-and-whiskers plot, one-way ANOVA test 
with Bonferroni’s multiple comparison adjustment, ***P < 0.001) showing 
significantly reduced levels in hsp70:cxcl12a transgenics compared to non- 
transgenic controls. This effect is suppressed by Bafilomycin A1 treatment, 
resulting in increased accumulation of Cxcr4b-positive intracellular vesicles 
(f). Insets show single plane magnification of the squared area. Scale bars, 

20 um. NS, not significant. 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b 
sfGFP TagRFP 

0.20 5 0.20 5 
> t=0 ~ t=0 
=| = =] - 
& t=3h a t=3h 
> 0155 > 0154 
7] 7) 
t= = 
2 2 
£ & 
g. CAR @ 0-104 
1S} Oo 
(= c 
oO oO 
oO oO 
no n 
® 0.05 © 0.05 
fo} fo} 
3 =) 
Ww Le 

o.oo} 2= 7 0.00 1 7=7 

a a 
200 175 150 125 100 75 50 25 0 200 175 150 125 100 75 50 25 0 
Distance from leading edge (um) Distance from leading edge (yum) 

c TagRFP/sfGFP Cxcra4b—tFT 

1.55 

t=3h 

1.0 
2 
I 
ina 

0.5 4 

oo ee 


r T T T T T T T 1 


200 175 150 125 100 75 50 25 0O 


Distance from leading edge (um) 


Extended Data Figure 3 | Cxcr4b-tFT lifetime ratio is stable over several tissue (c). Median and MAD of 7 samples acquired on the same day with 
hours. a-c, Spatially resolved profiles of Cxcr4b-tFT-expressing primordia at _ identical imaging conditions are shown. d, Ratiometric images of one 

32 h.p.f. (t= 0) and 35 h.p.f. (f= 3h) show that although sfGFP (a) and representative sample through time. Lookup table = red/green fluorescence 
TagRFP (b) median fluorescence intensities increase through time, their ratio _ intensity ratio. Scale bar, 20 pm. All lifetime measurements were performed 
instead stays constant and maintains a gradient across the front-rear axis ofthe during this time window, whenever possible. 


©2013 Macmillan Publishers Limited. All rights reserved 


Gxcr4b Ab/Lyn—GFP 


Extended Data Figure 4 | Increased turnover in primordium leading 
domain results in graded distribution of endogenous Cxcr4b receptor. 
Anti-Cxcr4b immunohistochemistry reveals a gradient of endogenous receptor 
along the primordium. a, Cxcr4b antibody (red) and a homogeneous 
membrane counter-label given by cldnb:lyn-GFP transgenic line (green) 
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displayed in segmented (top, sum projection of segmented stack) and 
ratiometric form (bottom, calibration bar = Cxcr4b antibody/Lyn-GFP 
fluorescence intensity ratio). b, Cxcr4b antibody/Lyn-GFP fluorescence 
intensity ratio profiles. Median and MAD after normalization of individual 
profiles by the front-most 15 ym are plotted. 
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Extended Data Figure 5 | Cxcr4b-tFT lifetime ratio gradient results from 
protein turnover not from a transcriptional gradient. a, Schematic 
representation of the modified BACs used to create Cxcr4b-tFT and mem-tFT 
reporter lines. b, Predicted behaviour of Cxcr4b-tFT and mem-tFT in response 
to Cxcl12a. ¢, d, Ratiometric images of 36 h.p.f. Cxcr4b-tFT (c) and mem-tFT 
(d) of whole-mount transgenic embryos. Segmentation was performed using 
intensity-based thresholding on sfGFP channel. Scale bars, 200 jum. Calibration 
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bar = normalized red/green ratio. e, f, Raw primordium images (merge of 
green and red channels) corresponding to ratiometric images shown in Fig. 2a, 
b. Scale bars, 20 jum. g, Ratiometric images of cxcr4b:mem-tFT before (top) and 
10h after (bottom) heat-shock-induced pulse of Cxcl12a. Scale bars, 20 jum. 
h, Spatially resolved lifetime ratio analysis of the sample in g. Although overall 
the ratio increases, during the long time lag, no front to rear gradient could be 
detected, in contrast to the behaviour of Cxcr4b-tFT (compare to Fig. 2e, f). 
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Extended Data Figure 6 | Cxcr7-mediated internalization of Cxcl12a~GFP 
at the tissue rear. a, Film-strip montages of z-stacks showing mosaic 
expression of Cxcl12a—GFP resulting in preferential internalization at the rear 
of the primordium (outlined by white line, corresponding to Supplementary 
Video 3). Stacks were acquired 1-2 h after heat-shock induction (1h at 38 °C) 
of hsp70:cxcll2a-GFP_T2A_mKate2-CAAX-injected embryos. Accumulation 


LETTER 


Cxcl12a—GFP 


Cxcl12a—GFP (anti GFP Ab) 


of Cxcl12a-GFP-positive vesicles in the primordium (yellow arrows) and 
Cxcl12a—GFP-expressing cells within the primordium are indicated (white 
arrowheads). Scale bar, 20 um. b, Anti-GFP and anti-Cxcr7 
immunohistochemistry showing co-localization of Cxcll12a—GFP (green) and 
Cxcr7 (red) in vesicles at the rear of the primordium. Images show maximum 
intensity projection. Scale bars, 10 um. 
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Extended Data Figure 7 | Validation of internalization-defective 
chemokine receptor forms. a, Cartoon illustrating the Gal4/UAS strategy used 
to induce mosaic expression of chemokine receptors in the primordium. 

b, Schematic of internalization-defective Cxcr4b form (Cxcr4*), as described in 
ref. 16. Mutated putative phosphorylation sites (S to A) are shown in orange. 
c, Examples of Gal4/UAS-driven expression of Cxcr4b-GFP (green or 
greyscale, top) or Cxcr4b*-GFP (green or greyscale, bottom) in cxcr4b:NLS- 
tdTomato (red) transgenic embryos. Both receptor forms localize at the plasma 
membrane (arrowheads). Higher magnification of the squared area shows 
internalized Cxcr4b (arrows), but not Cxcr4b*, under unstimulated conditions, 
presumably in response to endogenous Cxcl12a. Scale bars, 10 jum. 
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d, e, Heat-shock-induced Cxcl12a expression causes complete internalization 
of Cxcr4b-GFP (d), whereas Cxcr4b*-GFP remains at the plasma membrane 
(e). Thus, Cxcr4b* does not internalize Cxcl1 2a. f, An internalization-defective 
form of Cxcr7 (Cxcr7*) was created by deleting the 40 C-terminal amino acids 
of the receptor. Putative ubiquitination and phosphorylation target residues, 
conserved in humans, are shown in red and orange, respectively. g, GFP fusion 
of full-length Cxcr7 localizes in vesicles (arrows in the inset), consistent with 
antibody stainings of endogenous Cxcr7 (Fig. 3a). h, C terminus deletion results 
in clear receptor re-localization to the plasma membrane (arrowheads). Cxcr7* 
is only occasionally found in vesicles (arrows). Expression of both receptor 
forms was driven by Gal4/UAS. Scale bars, 50 um (10 [um in the insets). 
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Extended Data Figure 8 | Cxcr7-GFP misexpression alters migrating of outlines of several primordia in the same conditions as in a—c. Primordia are 
primordium morphology. a-c, Primordia at 36 h.p.f. (marked by NLS- aligned by the leading edge (right). Ectopic Cxcr7-GFP, but not Cxcr4b-—GFP, 


dtTomato) in wild type (WT) conditions (a) or misexpressing Cxcr4b-GFP expression results in primordium elongation. This phenotype is Cxcr4b 
(b) or Cxcr7—GEP (c) under the control of the Gal4/UAS system. d, cxcr4b-’~ dependent, as elongated morphology is never observed in Cxcr4b mutants 
primordium misexpressing Cxcr7—GFP (green). Scale bars, 10 j1m. e, Overlay = misexpressing Cxcr7—GFP (d). 
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Extended Data Figure 9 | Cxcr4a is not required for primordium migration. 
a, In situ hybridization reveals cxcr4a expression in the primordium (36 h.p.f. 
embryo). Inset shows higher magnification of the squared area comprising the 
primordium. b, Images show primordium migration (red = cxcr4b:NLS- 
tdTomato) at 48 h.p.f. in the presence (top) or absence (bottom) of Cxcr4a. 
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Asterisk marks heart oedema, characteristic of cxcr4da ‘~ (corresponding to 
cxer4q'"0/"™20)2 © Quantification of primordium migration in cxer4a /— 
embryos. The absence of Cxcr4a does not affect migration when Cxcr4b is 
present. Dot-plot, median, interquartile range and number of analysed samples 
are shown. Unpaired t-test. 
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Extended Data Figure 10 | Statistical analysis of tissue-scale lifetime ratios. 
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corresponding to Figs 2d and 3c). b, Quantile-quantile (Q-Q) plots of the 
analysed data sets. c, Statistical tests results. 
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A high-resolution map of the three-dimensional 
chromatin interactome in human cells 


Fulai Jin, Yan Lil*, Jesse R. Dixon’?, Siddarth Selvaraj’”’, Zhen Yel, Ah Young Lee!, Chia-An Yen!, Anthony D. Schmitt"*, 


Celso A. Espinoza! & Bing Ren!” 


A large number of cis-regulatory sequences have been annotated in 
the human genome’”, but defining their target genes remains a 
challenge’. One strategy is to identify the long-range looping inter- 
actions at these elements with the use of chromosome conforma- 
tion capture (3C)-based techniques*. However, previous studies 
lack either the resolution or coverage to permit a whole-genome, 
unbiased view of chromatin interactions. Here we report a compre- 
hensive chromatin interaction map generated in human fibroblasts 
using a genome-wide 3C analysis method (Hi-C)°. We determined 
over one million long-range chromatin interactions at 5-10-kb 
resolution, and uncovered general principles of chromatin organi- 
zation at different types of genomic features. We also characterized 
the dynamics of promoter-enhancer contacts after TNF-a signal- 
ling in these cells. Unexpectedly, we found that TNF-a-responsive 
enhancers are already in contact with their target promoters before 
signalling. Such pre-existing chromatin looping, which also exists in 
other cell types with different extracellular signalling, is a strong 
predictor of gene induction. Our observations suggest that the three- 
dimensional chromatin landscape, once established in a particular 
cell type, is relatively stable and could influence the selection or acti- 
vation of target genes by a ubiquitous transcription activator in a 
cell-specific manner. 

We carried out Hi-C experiments to study the dynamic chromatin 
interactions in a primary human fibroblast cells (IMR90) in response 
to transient TNF-« signalling. Hi-C data from IMR90 cells before and 
after 1 h TNF-o treatment were combined, to produce a total of approx- 
imately 3.4 billion uniquely mapped paired-end reads from 6 biological 
replicates in each condition, among which approximately 1.4 billion 
are intra-chromosomal reads (Supplementary Tables 1 and 2). To 
accurately identify chromatin looping interactions with high sensitiv- 
ity and resolution, we devised an improved data filtering strategy® 
based on the strand orientation of Hi-C paired-end reads (Supplemen- 
tary Figs 1-6 and Supplementary Methods), which results in over 500 
million high-confidence read pairs (Supplementary Tables 1 and 2), 
each representing a legitimate ligation event between two restriction 
fragments on the same chromosome. As we recognized that some reads 
may be due to random collision events between restriction fragments*”, 
we also estimated the expected frequency between any two restriction 
fragments, and then fitted a negative binomial model to assess the 
significance of observed contact frequency (Supplementary Methods 
and Supplementary Figs 7-9). Compared to previous methods’, our 
data analysis method permits detection of chromatin interactions at 
short distance. For example, we observed asymmetric distribution of 
cis-contacts from highly expressed promoters to the immediate down- 
stream gene bodies (Supplementary Fig. 10). This observation is 
reminiscent of a recent study showing interactions between a subset 
of exons and their promoters*. Interestingly, although such bias at 
promoters is correlated with elongation of RNA polymerase II, it 


remains when transcription elongation is blocked by the positive 
transcription elongation factor b (P-TEFb) inhibitor flavopiridol (Sup- 
plementary Fig. 11), suggesting that the maintenance of promoter- 
gene-body contacts is independent of active transcription. 

To accurately map at high resolution the chromatin interactions 
genome-wide, we devised an algorithm (Supplementary Fig. 12) to 
identify statistically significant looping interactions centred on a given 
genomic region from Hi-C contact matrix (Fig. la). We applied this 
method to the CCL2 (chemokine (C-C motif) ligand 2) locus, and were 
able to determine the distal enhancers and CTCF (a chromatin organ- 
izer) binding sites interacting with the CCL2 promoter (Fig. la, b). Our 
algorithm also identified a number of previously reported long-range 
chromatin interactions at the homoebox A (HoxA) gene cluster? and 
the sonic hedgehog (SHH) locus’®, which were not readily observable 
from lower resolution analysis (Supplementary Figs 13 and 14). We 
then performed conventional 3C experiments to validate six pairs of 
long-range interactions identified at five different genes, and the 
results confirmed the reliability of our method (Fig. 1c and Sup- 
plementary Fig. 15). 

We next applied the above algorithm to the 518,032 anchor regions 
in the human genome, with each containing one or a few HindIII restric- 
tion fragments (fragments shorter than 2 kilobases (kb) are merged) 
(Fig. 2a), and uncovered a total of 1,116,312 chromatin interactions 
with a false discovery rate (FDR) of 0.1 (Supplementary Data). We 
found that strong interactions supported by lower P values and higher 
contact frequencies are more reproducible between biological repli- 
cates (Supplementary Fig. 16). As interactions between loci separated 
by more than 2 megbases (Mb) are very rare (Fig. 2c), we limit our 
search to this genomic span. The sizes of the identified interacting 
DNA loci range from several hundred base pairs to over 50 kb, with 
a median of 10.5 kb (Fig. 2b). We were able to identify chromatin 
interactions that span a genomic distance from several hundred base 
pairs to over 1 million base pairs (Fig. 2c). Consistent with previous 
reports that the genome is partitioned into megabase-sized topological 
domains’, we found that a majority of the identified chromatin 
interactions in the IMR90 cells are located within the same topological 
domains (Fig. 2d and Supplementary Fig. 18). 

We next characterized the chromatin interactions centred on the 
cis-elements annotated in the IMR90 cell genome'* (Supplementary 
Data). Chromatin looping interactions are significantly enriched at cis- 
regulatory elements, particularly active promoters, enhancers and 
CTCF binding sites, and are rare at inactive transcription start sites 
(TSSs) or regions with repressive chromatin domains marked by 
H3K27me3 (Fig. 2e, f)"*; notably, both active and poised enhancers 
(distinguished be the status of H3K27ac)'*"* are found equally likely to 
engage looping interactions (Fig. 2e, f), raising the possibility that DNA 
looping could take place after priming of enhancers by H3K4mel but 
before further activation’. Interestingly, the chromatin interactions 
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centred on CTCF binding sites tend to occur over a longer range than 
other types of cis-elements (Fig. 2g), confirming a recent result obtained 
from selected loci*®. We also explored the spatial organization of cis- 
elements by examining preferential interaction between different classes 
of elements. Strongest enrichment was observed between H3K27me3 
marked regions (Fig. 2h), consistent with the known compact three- 
dimensional structure at this type of repressive chromatin domains”" 
(for example, Supplementary Fig. 14a). The inactive promoters tend to 
interact with regions depleted of enhancers but enriched for repressive 
mark H3K27me3 (Fig. 2h), whereas CTCF binding sites loop to both 
active and inactive promoters with no preference, as reported previously”’. 
It is also interesting to observe that CTCF binding sites seem to prefer 
promoters over enhancers (Fig. 2h), suggesting a specific role for CTCF 
in organizing long-range chromatin interactions to promoters. 
Looping interactions between cis-regulatory elements and gene pro- 
moters have been shown to be important for transcription regulation 
at a number of loci’. The genome-wide identification of chromatin 
interactions in the IMR90 cells allowed us to examine this concept 
systematically. We first focused on the looping interactions anchored 
to gene promoters, and denote the identified interacting sequences as 
promoter tethered regions (PTRs) (Fig. 3a). In IMR90 cells, we found 
57,585 PTRs identifying 29,132 enhancer—-promoter pairs involving 
6,133 active promoters and 15,432 distal active enhancers (Supplemen- 
tary Data). Only approximately 25% of enhancer—-promoter pairs are 
within a 50-kb range, and approximately 57% span 100 kb or larger 
genomic distance, with a median distance of 124kb (Fig. 3b). We 
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assigned 55% of distal enhancers to at least one active promoter, and 
25% of enhancers to two or more active promoters (Fig. 3c, left panel). 
This result confirms previous observation that promoters and enhan- 
cers often form complex networks to regulate transcription’. We fur- 
ther reasoned that genes sharing common enhancers (denoted hub 
enhancers) are likely to have coordinated gene expression patterns. 
Indeed, genes sharing the same nuclear factor kB (NF-«B)-responsive 
enhancers are more frequently induced together by TNF-o than 
expected by chance (Fig. 3d). As an example, CKAP2L and ILIA are 
induced simultaneously by TNF-« although lacking promoter-bound 
p65 peaks, and they share overlapping distal PTRs containing multiple 
NF-kB binding sites (Fig. 3e). Similar examples can be found in other 
gene clusters co-induced by TNF-« treatment (Supplementary Fig. 19). 
These results therefore provide a molecular mechanism for coordi- 
nated gene expression of neighbouring genes. 

Interestingly, 46% of the active genes do not interact with any distal 
enhancer (Fig. 3c, right panel). Gene ontology analysis showed that 
these genes are enriched with housekeeping genes (Supplementary 
Fig. 20a). The remaining 54% of the active promoters demonstrate 
extensive looping interacting with enhancers (average 4.75 enhancers 
per gene, Fig. 3c, right panel), and they are enriched with genes related 
to biological pathways such as signal transduction (Supplementary 
Fig. 20b). This analysis suggests that housekeeping genes, despite being 
highly transcribed, do not engage a lot of distal regulatory elements. 
Conversely, genes involved in cell-specific functions are under exten- 
sive control of distal regulator sequences. 
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Figure 2 | Characterization of the IMR90 chromatin interactome. 

a, Histogram showing the size distribution of all HindIII restriction fragments 
in human genome. The red curve plots the size distribution of all anchors used 
in this study. b, Size distribution of the genomic regions identified as interacting 
with all anchors. c, Distribution of genomic spans of all identified chromatin 
interactions. Histogram, frequency distribution; red curve, cumulative fraction 
with increasing distances. d, Fraction of chromatin looping interactions that fall 
within the same topological domains is plotted as a function of genomic 
distance between the two interacting loci (red curve). Black curve, expected 
fraction calculated from random shuffling locations of topological domains 
(100 iterations; dashed error lines, s.d.). e, Average number of peaks identified 
for anchors with different cis-elements. f, Percentages of all interactions 
involving various types of cis-elements (either anchor or target peak has the 
elements). In e and f, Z-scores were calculated comparing the actual values to 
simulation by randomly shuffling the locations of cis-elements (100 iterations, 
two-side Z-test). *Z > 50, **Z > 100, ***Z > 150. g, Box plot showing the 
distance distribution from different types of anchors to their targets. Median 
distances are also labelled. t-statistics are computed comparing log- 
transformed distance between each type of anchors to all anchors as control 
(dash horizontal line). *t > 20, **t > 40, ***t > 80 (two-side t-test). 

h, Preferential interactions between different types of cis-elements. Heatmap 
shows the fold enrichment of different type of pair-wise combinations. P values 
are computed using hypergeometric test and denoted in each cell. 


We next examined long-range looping interactions at transcriptional 
enhancers, focusing on those bound by the p65 subunit of NF-«B trans- 
cription factor. Using chromatin immunoprecipitation followed by high- 
throughput sequencing (ChIP-seq), we identified 15,621 p65 binding 
sites in the genome after TNF-o treatment, 2,315 (14.8%) of which can 
be classified as ‘active p65 binding sites’ because they exhibit increased 
H3K27ac levels and enhancer RNA expression upon TNF-« signalling 
(Supplementary Figs 21 and 22). Consistent with their putative role in 
mediating transcriptional induction, these active p65 binding sites are 
enriched near TNF-«-dependent genes (Supplementary Fig. 22c). We 
next tested whether the long-range interactions between these p65 
binding sites and their target promoters are correlated with transcrip- 
tional induction. Indeed, at the promoters that exhibit interactions 
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Figure 3 | Identification and characterization of promoter-enhancer 
interactions in IMR90 cells. a, A schematic of promoter tethered regions 
(PTRs). b, Distribution of distances between the promoters and enhancers 
found within the PTRs. c, Bar charts show the degrees of interactions. Left, 
percentage of enhancers that are looped to active promoters found in PTRs. The 
first value of 11 is the mean degree of promoter interactions for all enhancers; the 
second value is the mean degree of promoter interactions for enhancers 
interacting with at least one promoter. Right, distribution of degrees of 
enhancer interactions for active promoters. d. Genes sharing enhancers are 
co-regulated. Among the target genes of TNF-«-responsive hub enhancers, 
proportion of co-induced gene pairs (greater than twofold) are plotted and 
compared to random simulation (100 iterations, two-side Z-test, *P < 0.05, 
**P < 0.01, error bar, s.d.). e, Genome browser snapshots showing the virtual 
3C plots of the CKAP2L and IL1A promoters. f, Box plots comparing the 
induction of different gene groups based on the location of p65-induced 
enhancers. ‘No enhancers’, genes with no such enhancers within 2 Mb; 

‘21 not in PTR’, p65-induced enhancers are within 2 Mb but not in PTR. 


with one or more active p65 binding sites, significantly higher levels 
of transcriptional induction were observed than the promoters that do 
not interact with distal p65 binding sites (Fig. 3f), suggesting that the 
identified long-range chromatin interactions may have a key role in 
transcriptional regulation of the TNF-«-inducible genes. 

The high-resolution map of chromatin interactions may also improve 
the prediction of target genes of distal enhancers. Currently, a common 
practice is to assign distal enhancers to their nearest promoters, assum- 
ing one enhancer is linked to just one target gene (proximity approach). 
However, this approach cannot explain all of the 828 TNF-a-responsive 
genes. We found that 331 (40%) of these genes have one or more p65 
binding sites within 2.5 kb of their promoters, and 362 genes of the 
remainder can be assigned to one or more NF-«B binding sites by 
proximity approach, leaving still 135 TNF-«-induced genes unexplained. 
Using a recently published enhancer—promoter connection map” based 
on correlated chromatin features across diverse tissues or cell types, we 
were able to link 10 of the 135 unexplained TNF-a-inducible genes to 
distal NF-«B binding sites. Using the chromatin interactome map, 74 
(55%) of the unexplained genes can be assigned to a NF-kB binding site 
(Supplementary Fig. 23). This result illustrates that the chromatin 
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interactome map described here could be valuable for the study of 
long-range regulation of gene expression by transcription factors. 

We found no obvious alterations of megabase topological domains”” 
in IMR90 cells after TNF-« treatment (for example, Supplementary 
Fig. 13). As previous studies have shown that gene activation by enhan- 
cers is accompanied by alteration of chromatin interactions’***, we 
expected that at shorter distance, binding of NF-kB to enhancers 
would induce looping interactions that bring the distal enhancers to 
proximity with target genes. To our surprise, we found that at the vast 
majority of TNF-« responsive enhancers, there is little change of DNA 
looping after treatment (Fig. 4a). These results suggest that in general, 
enhancer-promoter interactions already form in untreated cells; and 
these pre-existing DNA-structures are not significantly altered by 
transient activation or repression of enhancers. 3C assays confirm that 
DNA looping exists at several loci before TNF-o treatment (Fig. le and 
Supplementary Fig. 15a—c). We further compared the normalized 
reads count anchored to the TNF-o-activated enhancers at different 
ranges of genomic distances (Fig. 4b and Supplementary Methods). 
Consistent with the results in Fig. 4a, transient activation of p65-bound 
enhancers does not lead to significant changes in chromatin interac- 
tions (Fig. 4b). By contrast, the chromatin interactions (especially 
within a short genomic distance) at cell-type-specific enhancers are 
highly variable between cell types (Fig. 4c, d), suggesting the existence 
of specific chromatin interaction structures at cell-type-specific enhan- 
cers. Interestingly, the discrepancy between signal-dependent and cell- 
type-specific enhancers is well correlated with the levels of H3K4mel1 
at those dynamically regulated enhancers: despite the quick induction 
of the H3K27ac mark at TNF-a-responsive enhancers, strength of 
H3K4mel signal is largely unchanged (Fig. 4e), but on the other hand, 
cell-type-specific enhancers have highly cell-specific H3K4mel1 occu- 
pancy (Fig. 4f). 

Recently, pre-existing promoter-enhancer looping was observed at 
several loci induced by p53, FOXO3 and glucocorticoid receptor using 
the 4C (chromosome conformation capture combined with sequencing) 
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approach’, Our genome-wide analysis of chromatin interactome 
maps in IMR90 cells suggests that this is likely to be a common rule 
rather than a special case. To further demonstrate its generality, we 
examined six additional promoter-enhancer pairs by 3C assays in four 
different cell types (IMR90, HUVEC, MCF7 and LNCaP cells) under 
different stimuli (IFN-y, TNF-o, B-oestradiol and 5a-dihydrotestosterone, 
respectively). In all of these examples, we found evident pre-existing 
promoter-enhancer contacts, and the looping interactions are largely 
unchanged after enhancer activation and target gene induction (Sup- 
plementary Figs 15d and 24). Our results predict that pre-existing 
chromatin looping interactions could dictate the spectrum of the target 
genes for a transcription factor even before it is activated. Indeed, p65 
binding sites looping to the promoters before induction are much more 
likely to result in transcriptional activation of the linked gene than 
otherwise (Fig. 4g). This trend is particularly obvious when the p65- 
binding sites are located far from the linked promoters. On the basis of 
this observation, we conclude that pre-existing DNA-looping interac- 
tions between enhancers and promoters allow a ubiquitous, signal- 
dependent transcription factor to affect a selected set of genes in a 
cell-type-specific manner. 

In summary, we have demonstrated that enhancer—promoter inter- 
actions already form in each cell type before the binding of signal depen- 
dent transcription factors, and they undergo relatively few changes 
during transient transcriptional activation. Several recent genome- 
wide studies have revealed that in different cell types, the repertoire 
of specific enhancers provides a unique context for the activation of 
different transcriptional programs in response to signal-dependent 
transcription factors including NF-«B'’’”*. Here, our results further 
suggest that targets of cell-specific enhancers are already hardwired 
into the chromatin architecture in each cell lineage. We therefore 
propose that cell-type-specific looping structure, by controlling the 
accessibility of the enhancers to their specific targets, may form an 
additional layer of regulation in determining the distinct transcription 
programs in different cell types. 


Figure 4 | The higher order chromatin structure 
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METHODS SUMMARY 


Hi-C experiments were performed in human primary IMR90 fibroblasts. ChIP- 
seq, global run-on sequencing (GRO-seq), or RNA-seq libraries were also gener- 
ated from IMR90, HUVEC, MCEF7 or LNCaP cells and sequenced on the Illumina 
Hi-Seq2000 platform. All the reads were mapped to reference human genome 
(hg18). More information about the experiments and detailed descriptions of Hi- 
C data analysis pipeline, including data filtering, normalization, statistical mod- 
elling and interaction calling can be found in Supplementary Methods. 
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The design of G-protein-coupled receptor (GPCR) allosteric mod- 
ulators, an active area of modern pharmaceutical research, has 
proved challenging because neither the binding modes nor the 
molecular mechanisms of such drugs are known’. Here we deter- 
mine binding sites, bound conformations and specific drug- 
receptor interactions for several allosteric modulators of the M2 
muscarinic acetylcholine receptor (M2 receptor), a prototypical 
family A GPCR, using atomic-level simulations in which the mod- 
ulators spontaneously associate with the receptor. Despite substan- 
tial structural diversity, all modulators form cation-7 interactions 
with clusters of aromatic residues in the receptor extracellular ves- 
tibule, approximately 15 A from the classical, ‘orthosteric’ ligand- 
binding site. We validate the observed modulator binding modes 
through radioligand binding experiments on receptor mutants 
designed, on the basis of our simulations, either to increase or to 
decrease modulator affinity. Simulations also revealed mechanisms 
that contribute to positive and negative allosteric modulation of 
classical ligand binding, including coupled conformational changes 
of the two binding sites and electrostatic interactions between 
ligands in these sites. These observations enabled the design of 
chemical modifications that substantially alter a modulator’s allos- 
teric effects. Our findings thus provide a structural basis for the 
rational design of allosteric modulators targeting muscarinic and 
possibly other GPCRs. 

A third of marketed drugs act on GPCRs, generally by binding at the 
orthosteric site in competition with the endogenous ligands that nat- 
urally regulate receptor signalling’. There is considerable interest, how- 
ever, in drugs that bind at distinct, allosteric sites. Such allosteric 
modulators may achieve selectivity between receptor subtypes that 
possess nearly identical orthosteric sites'*. In addition, because they 
act primarily by altering the affinity or efficacy of endogenous ligands, 
allosteric modulators may provide a means to fine-tune cellular res- 
ponses to the body’s natural signalling patterns*. 

Despite recent advances in GPCR crystallography”, no structure 
has been determined of a GPCR bound to a drug-like allosteric modu- 
lator. Moreover, the molecular mechanisms by which such modulators 
affect GPCR signalling may depend on dynamical properties that 
would not be evident from a single static structure. 

To address these structural and mechanistic questions, we perfor- 
med atomic-level molecular dynamics simulations of several allosteric 
modulators binding spontaneously to the M2 receptor, a long-serving 
model system for allosteric modulation of GPCRs*. Such simulations 
have previously proved capable of capturing the complete process of 
drug binding, with a ligand exploring a receptor’s surface and “discover- 
ing’ the binding conformation at crystallographic accuracy, without 
previous knowledge of the binding site’. Our simulations, which 
included lipids, water and ions surrounding the receptor, allowed 


conformational changes in both the receptor and the modulators, 
and made no assumptions about where the modulators might bind. 

Figure la depicts the simulated binding process of a prototypical 
allosteric modulator, C7/3-phth, to an unliganded human M2 receptor 
(cf. Supplementary Fig. 1 and Supplementary Tables 1 and 2). C,/3- 
phth settled into a stable and reproducible bound pose in the extra- 
cellular vestibule, a region approximately 15 A from the orthosteric site 
(Fig. 1b, Supplementary Video 1, Supplementary Fig. 2 and Supplemen- 
tary Table 3). The C;/3-phth core—the two ammonium groups and the 
heptyl linker—remained tightly bound to the receptor, whereas the 
distal phthalimide groups were highly mobile (Supplementary Fig. 3). 
The close structural analogue dimethyl-W84 bound in a very similar 
fashion (Fig. 1c). The binding modes observed for C;/3-phth and dimethyl- 
W684 are consistent with extensive mutagenesis data: all mutated resi- 
dues that reduce the binding affinity of C;/3-phth, dimethyl-W84 or 
related bis-ammonium alkane ligands by more than fivefold’? * formed 
stable contacts with the relatively immobile C7/3-phth and dimethyl- 
W684 cores (Fig. 1b and Supplementary Fig. 4). 

We extended our binding simulations to include structurally diverse 
and clinically relevant drugs, including the neuromuscular-blocking 
drugs gallamine and alcuronium and the convulsant strychnine. These 
modulators are known to compete with one another, C,/3-phth and 
dimethyl-W84 for binding to the M2 receptor, and are thus believed to 
bind to the same region of the receptor’. 

In our simulations, all these allosteric modulators not only bound to 
the same site—the extracellular vestibule—but also made similar bind- 
ing interactions, despite their structural diversity (Fig. 1c and Sup- 
plementary Fig. 5). Ammonium groups from each ligand occupied 
one or both of two common ‘binding centres’ in the extracellular ves- 
tibule. These centres are each defined by a pair of aromatic residues— 
an ‘aromatic box’—that form cation-7 interactions with the ammo- 
nium group (Fig. 1d, e): Tyr 177°°'? and Trp 422” at centre 1, and 
Tyr 80°*! and Tyr 83 at centre 2. (Superscripts refer to Ballesteros— 
Weinstein residue numbering"®, with ECL2 denoting the second extra- 
cellular loop.) 

The specific poses and receptor—drug interactions we observed, 
although completely in accord with extensive previous mutagenesis 
data (Supplementary Fig. 4) and the widely held belief that these allos- 
teric modulators bind in the vicinity of the extracellular vestibule'””’, 
generally differ from earlier predictions. Previous studies have sug- 
gested, for example, that the abundant aromatic residues in the ves- 
tibule interact with the aromatic groups common to many allosteric 
modulators'*"*. We found that these residues instead formed cation-7 
interactions with modulator ammonium groups; modulator aromatic 
groups, when present, formed non-specific hydrophobic contacts with 
the receptor. Indeed, hexamethonium, devoid of aromatic groups, 
bound in a pose analogous to that of the C,/3-phth core in simulation 
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Figure 1 | Structurally diverse allosteric modulators bind spontaneously to 
the human M2 muscarinic acetylcholine receptor in simulation, revealing a 
common binding mode. a, C;/3-phth diffuses freely before binding stably in 
the M2 receptor extracellular vestibule. The C7/3-phth position at various times 
is represented by a stick connecting its two ammonium groups. (Top: C,/3- 
phth structure.) b, Typical bound pose of C;/3-phth (purple). Residues known 
to reduce binding greater than fivefold upon mutation (cyan) all contact C7/3- 
phth more than 90% of the time after it binds. c, Typical bound poses for 
dimethyl-W84, gallamine, strychnine and alcuronium. Ammonium groups are 


(Supplementary Fig. 6). We also found that modulator cationic 
ammonium groups did not form stable contacts with anionic receptor 
residues such as those in the widely studied ECL2 ‘EDGE’ motif’? 
(Supplementary Fig. 7), in contrast to such direct recognition of the 
ammonium group of orthosteric ligands by the conserved anionic res- 
idue Asp 103°? (ref. 5). 

To validate these computationally determined binding modes, we 
experimentally measured the binding of C;/3-phth and gallamine to 
several new M2 receptor mutants designed, on the basis of simulations, 
either to increase or to decrease modulator affinity (Fig. 2 and Sup- 
plementary Fig. 8). To increase affinity, we introduced aromatic residues 
near centres 1 and 2, to strengthen the cation-n interactions (Fig. 2a), 
or anionic residues near centre 2, to form favourable ionic interactions 
(Fig. 2b). To decrease affinity, we introduced cationic residues near 
either centre (Fig. 2c). In all cases, affinity increased or decreased as 
predicted (Fig. 2d and Supplementary Figs 9-12). Additional mutagen- 
esis results suggest that centre 1 contributes more strongly than centre 
2 to C,/3-phth and gallamine binding, in accord with computational 
findings (Supplementary Figs 13 and 14). 

Next, we examined the mechanism of allosteric modulation, in 
particular, the ability of allosteric modulators to increase or decrease 
the affinity of orthosteric ligands. Because the M2 receptor crystal 
structure represents an inactive, antagonist-bound state, we focused 
on modulation of antagonist binding. 

C,/3-phth, dimethyl-W84 and gallamine are negative allosteric modula- 
tors (NAMs) of the archetypal orthosteric antagonist N-methylscopolamine 
(NMS): their binding weakens NMS affinity, and vice versa. Alcuronium 
and strychnine, by contrast, are positive allosteric modulators (PAMs) 
of NMS: their binding increases NMS affinity. Simulations of modula- 
tors binding to unliganded and NMS-bound receptors were consistent 
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highlighted (blue disks). d, Schematic representation of the ammonium 
binding centres. e, Bound locations of modulator ammonium groups (spheres). 
C,/3-phth, dimethyl-W84 and gallamine each occupy centres 1 and 2 (grey 
ellipsoids); strychnine occupies only centre 1, and alcuronium only centre 2. 
The position of each centre varies slightly depending on the bound modulator 
because the surrounding residues reposition to accommodate the modulator. 
The third gallamine ammonium and the second of alcuronium lie outside the 
two centres (alcuronium cannot occupy both simultaneously owing to its 
geometry). 


with these experimental observations (Fig. 3a, Supplementary Figs 2, 
15 and 16 and Supplementary Table 3). Although C;/3-phth bound 
consistently in a stable, reproducible pose in simulations that did not 
include an orthosteric ligand, for example, it bound only transiently to 
an NMS-occupied receptor (Fig. 3a). 

Our simulations revealed two mechanisms that seem to contribute 
to the cooperativity of orthosteric and allosteric ligand binding. First, 
electrostatic repulsion between a cationic orthosteric ligand (for example, 
NMS) and a cationic allosteric modulator weakens the binding of one 
in the presence of the other. The degree of repulsion depends on the 
total charges of both the modulator and the orthosteric ligand, and on 
the locations of those charges when the ligands are bound (Fig. 3b). 
Less cationic NAMs, or those whose positive charges are further from 
NMS, experience less repulsion. Electrostatic repulsion seems to account 
for most of the experimentally observed negative cooperativity of the 
NAMs we studied (Supplementary Table 4). Indeed, replacement of a 
cationic nitrogen atom with a neutral silicon atom was found to shift 
the cooperativity of C;/3-phth in the positive direction”. 

Electrostatic repulsion, however, cannot explain the positive coope- 
rativity of cationic PAMs. Indeed, on the basis of electrostatics alone, 
the NAM C;/3-phth and the PAM alcuronium would be predicted to 
have identical, not opposite, cooperativities. 

Our simulations suggest a second operative mechanism underlying 
cooperativity: the conformations of the orthosteric and allosteric sites 
are coupled such that the presence ofa ligand in one affects the shape of 
the other (Fig. 3c and Supplementary Fig. 17). In simulations with 
NMS bound in the orthosteric site, both the allosteric and orthosteric 
sites maintained the wide-open conformations observed crystallogra- 
phically in the presence of 3-quinuclidinyl benzilate (QNB), another 
orthosteric antagonist. By contrast, in simulations with no ligand present, 
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Figure 2 | Experimental validation of computationally derived binding 
modes. a-c, Mutagenesis predictions; observed binding poses, and simulations 
of mutants, indicate that certain mutations should increase or decrease C;/3- 
phth and gallamine binding affinity. a, Aromatic residues (N410Y, N419W) 
contribute extra cation—7 interactions at centre 1, increasing affinity. b, Anionic 
residues (T423E, T84D) attract an ammonium group at centre 2, increasing 
affinity. c, Cationic residues (N410K, N423K, Y80K, Y83K) near centres 1 and2 


both the orthosteric and allosteric sites assumed a narrower conforma- 
tion, with ECL2 shifting approximately 2.5 A towards ECL3 as helix 5 
shifted slightly into the region vacated by the orthosteric ligand (Fig. 3c). 

Interestingly, binding of the wide, bulky PAM alcuronium tended to 
force the allosteric and orthosteric sites into their open conformations, 
even when the orthosteric site was empty (Supplementary Figs 17 and 
18). This conformational change seems to contribute to the positive 
cooperativity between NMS and alcuronium: binding of either one 
shifts the receptor into the conformation with higher affinity for the 
other. By contrast, the narrow NAM C,/3-phth can bind to either the 
wide or narrow vestibule conformation; in fact, it forms more extensive 
contacts when bound to the narrow conformation, which is favoured 
when the orthosteric site is empty. Strychnine’s weak positive coop- 
erativity seems to result from a mild preference for a more open ves- 
tibule conformation coupled with the weakest electrostatic repulsion of 
the modulators studied. Although our data do not rule out other pos- 
sible mechanisms, the combination of these two effects—sterics and 
electrostatics—seems sufficient to explain positive and negative modu- 
lation by these allosteric modulators. 

On the basis of this mechanistic understanding, we predicted that 
one could shift the cooperativity of a modulator in the positive direc- 
tion by modifying the modulator such that it could bind only to a wider 
conformation of the extracellular vestibule. To test this prediction, we 
designed and synthesized a bulky C,/3-phth variant, 4P-C;/3-phth, 
that bound in the same pose as C;/3-phth in test simulations but 
required a wider vestibule to do so (Fig. 3d and Supplementary Fig. 19). 
Indeed, radioligand binding experiments showed that, in comparison 
with C,/3-phth, 4P-C;/3-phth has significantly reduced negative—that 
is, approximately fourfold more positive—cooperativity with NMS 
(Fig. 3d). 4P-C,/3-phth also has approximately 50-fold higher binding 
affinity than C,/3-phth, suggesting that this modification did not dis- 
rupt the binding mode (Supplementary Fig. 20). 


repel or compete with cationic modulators, decreasing affinity. d, Mutagenesis 
results; radioligand binding experiments ([*H]NMS displacement), performed 
on whole cells expressing wild-type and mutant human M2 receptors, 
confirmed computational predictions. Data represent mean + s.e.m. of three to 
five independent experiments (see Supplementary Table 6). Asterisks indicate a 
significant difference from the wild type (P < 0.05). 


Allosteric modulation of agonist binding may rely on similar mecha- 
nisms. As muscarinic agonists, including the endogenous agonist acetyl- 
choline, are generally cations, electrostatic interactions between modulators 
and agonists should mimic those seen with antagonists. The smaller 
size typical of agonists, and the orthosteric site contraction seen on 
activation of other family A GPCRs’”®, suggests that PAMs for agonists 
may tend to be smaller than PAMs for antagonists. 

Effective structure-based design of allosteric modulators must con- 
tend with the flexibility of the extracellular vestibule”’, which exhibits 
various conformations preferred by different modulators. In particular, 
computational screening approaches based on a single rigid receptor 
structure may identify only a subset of potent allosteric modulators**”. 
Indeed, our attempts to predict modulator binding poses using stand- 
ard docking protocols with the M2 receptor crystal structure yielded 
results substantially different from those observed in molecular dyna- 
mics simulations and in worse agreement with experimental data 
(Supplementary Fig. 21). 

Our results provide a structural basis for the subtype selectivity of 
muscarinic allosteric modulators’. The typically lower affinity of mod- 
ulators for the M3 receptor’, for example, is probably caused by the 
unique presence of a cationic residue, Lys 523’? within 8 A of centres 
1 and 2 (Supplementary Fig. 22). Several other poorly conserved resi- 
dues that form contacts with the bound allosteric modulators might 
also be exploited to achieve a desired selectivity profile (Supplementary 
Table 5). 

All the allosteric modulators we studied—PAMs and NAMs— 
typically retard both binding and dissociation of orthosteric ligands**. 
Confirming a previous hypothesis”’’, our simulations indicate that 
orthosteric ligands cannot easily gain access to, or egress from, their 
binding site when a modulator is bound in the extracellular vestibule, 
because the vestibule also typically serves as a way station along the 
binding pathway of orthosteric ligands. This explains how NAMs 
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NMS (orthosteric 
antagonist) 


Figure 3 | Mechanisms of positive and negative allosteric modulation. 

a, Allosteric modulator binding affinity depends on orthosteric site occupancy. 
Each bar represents a simulation of C7/3-phth binding to unliganded (left) or 
NMS-bound (right) M2 receptor. C7/3-phth bound stably in the canonical pose 
to the unliganded receptor, but frequently shifted pose or dissociated entirely 
when NMS was bound. b, Electrostatic interactions between ligands contribute 
to cooperativity. Blue shading denotes the change in electrostatic potential due 
to addition of NMS. Modulator ammonium groups are circled (yellow). 
Electrostatic repulsion with NMS is similar for C,/3-phth and alcuronium, but 
substantially larger for gallamine, which positions its third ammonium group 
particularly close to the orthosteric site (see Supplementary Table 4). 

c, Conformational coupling between sites contributes to cooperativity. Left: 
with no allosteric modulator bound, the extracellular vestibule of NMS-bound 
receptor is wider than that of unliganded receptor (molecular snapshots and 
histograms from 10-J1s simulations). Right: cross-sectional views from 
simulations without (left) or with (right) NMS in the orthosteric site, and with 
no modulator (top), alcuronium (middle) or C7/3-phth (bottom) in the 
allosteric site. Alcuronium binding requires a wide vestibule, whereas C;/3- 
phth can bind to a wide or narrow vestibule. d, We designed 4P-C,/3-phth (left) 
computationally to adopt the C7/3-phth binding pose but require a wider 
vestibule (right), rendering its cooperativity more positive (less negative). 
[HINMS binding experiments (bottom) indicate that the cooperativity of 
4P-C,/3-phth is approximately fourfold less negative than that of C7/3-phth. 
Data represent mean + s.e.m. of four independent experiments 

(see Supplementary Table 7). Asterisks indicate a significant difference from 
C,/3-phth (P < 0.05). 


may slow dissociation of an orthosteric ligand despite weakening its 
equilibrium affinity. 

Although numerous allosteric modulators seem to bind in or near 
the extracellular vestibules of muscarinic receptors and other GPCRs”*”®, 
certain modulators seem to bind elsewhere”’””*. Further computational 
and experimental investigation is necessary to determine the extent to 
which the molecular basis of allosteric modulation is conserved across 
GPCRs. The techniques we have demonstrated here might also be used 
to identify allosteric mechanisms operative in other protein families. 


METHODS SUMMARY 


Molecular dynamics simulations of the human M2 receptor, embedded in a 
hydrated palmitoyloleoylphosphatidylcholine bilayer with all atoms represented 
explicitly, were performed on Anton” using the CHARMM force field”. Simulations 
began with the receptor in the crystallographic conformation (Protein Data Bank 
accession number 3UON; ref. 5), but in most cases the co-crystallized antagonist 
(QNB) was either removed or replaced by NMS (Supplementary Fig. 23). In 
binding simulations, a single ligand was placed in the aqueous region away from 
the receptor. No artificial forces were applied to the ligand, which generally dif- 
fused extensively about the simulated volume before binding. 

Mutant receptors were constructed using standard cloning techniques and 
expressed as Chinese hamster ovary (CHO) stable cell lines. Saturation binding 
experiments were performed on whole cells using (HINMS (0.003-3 nM), in the 
absence or presence of atropine (100 |tM) at 22 °C. Inhibition binding and dissocia- 
tion kinetics experiments, also on whole cells, were performed using 0.2 nM [7H]NMS 
(a concentration that approximates the dissociation constant (Kq)) in the presence 
of various concentrations of C7/3-phth, gallamine, alcuronium or 4P-C;/3-phth at 
22°C. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


System setup for molecular dynamics simulations. Simulations of the human 
M2 muscarinic acetylcholine receptor (M2 receptor) were based on the M2-QNB 
complex crystal structure (Protein Data Bank accession number 3UON; ref. 5). 
Coordinates were prepared by removing the T4 lysozyme insert and capping 
residues 217 and 377 with N-methyl amide and acetyl groups, respectively (that 
is, intracellular loop 3 was absent from the simulations). Side chains for Lys 383°°" 
and Lys 384°*”, not resolved in the crystal structure, were modelled by hand. Hydrogen 
atoms were added using Maestro (Schrédinger), as previously described*’. Histidine 
residues were simulated as the neutral N,, tautomer. Other titratable residues were 
simulated in their dominant protonation state at pH 7 except for Asp 69”*°, which 
was protonated; Asp~”” is protonated during the entire photocycle in rhodopsin®. 
We performed a control simulation of C;/3-phth binding to an unliganded M2 
receptor in which Asp 69°” was deprotonated (Supplementary Table 2, condition 
AO); a sodium ion bound to D69*””, and afterwards, C,/ 3-phth bound in a pose 
that closely matched the pose seen in our other simulations, in which D69**° was 
protonated. Ligand protonation states are indicated in Supplementary Fig. 1. In 
particular, strychnine was simulated in the protonated form (pK, 8.26; ref. 33). 

The M2 receptor was inserted into an equilibrated palmitoyloleoylphosphati- 
dylcholine bilayer as described previously**. Sodium and chloride ions were added 
to neutralize the system, and extra NaCl was added to reach a final concen- 
tration of approximately 150 mM. In each simulation, the simulation box initially 
measured 79 A X 79 A X 85A, and contained one M2 receptor, approximately 
156 palmitoyloleoylphosphatidylcholine molecules, approximately 24 sodium 
ions, approximately 35 chloride ions and approximately 9,165 water molecules, 
for a total of approximately 53,000 atoms. A control simulation of C;/3-phth 
binding to an unliganded M2 receptor was designed to match radioligand binding 
experimental conditions closely; it contained approximately 10 mM MgCl, in 
addition to 100 mM NaCl (Supplementary Table 2, condition AP). This simu- 
lation (run at 22 °C rather than the usual 37 °C; see below) showed no noticeable 
differences in behaviour from our other simulations of C,/3-phth binding; in 
particular, C;/3-phth adopted the same bound pose. 

Depending on the simulation, the orthosteric site was either unoccupied or 
occupied by QNB or NMS. For simulations with QNB in the orthosteric site, 
the coordinates from the M2-QNB complex were used°. For simulations with 
an unoccupied orthosteric site, the co-crystallized QNB was deleted from the 
protein. For simulations with NMS in the orthosteric site, the M3 muscarinic 
receptor-tiotropium complex crystal structure (Protein Data Bank accession 
number 4DAJ; ref 35) was used to model the bound pose of NMS. The receptors 
were aligned using the Co. atoms of residues within 5 A of tiotropium, and tio- 
tropium was morphed into NMS; the NMS phenyl group was positioned to over- 
lap the more intracellular of the two tiotropium thienyl groups (Supplementary 
Fig. 23). 

Simulation protocol. All simulations were performed using Anton, a special- 
purpose computer designed to accelerate standard molecular dynamics simula- 
tions by orders of magnitude”*®. Simulations were equilibrated in the NPT 
ensemble (37 °C (310K), 1 bar (~1 atm)), with 5 kcal mol”! A~? harmonic posi- 
tion restraints applied to all non-hydrogen protein and ligand atoms; restraints 
were tapered off linearly over 50 ns. Bond lengths to hydrogen atoms were con- 
strained using M-SHAKE”, and an r-RESPA integrator*® was used (2-fs time step; 
long-range electrostatics computed every 6 fs). Production simulations used the 
same integrator, pressure and temperature, and were initiated from the final 
snapshot of the corresponding equilibration simulation, with velocities sampled 
from the Boltzmann distribution at 310 K. Van der Waals and short-range elec- 
trostatic interactions were cut off at 13.5 A, and long-range electrostatic interac- 
tions were computed using the k-space Gaussian split Ewald method” with a 
32 X 32 X 32 grid, o = 3.33 A and a, = 2.33 A. 

Spontaneous binding of allosteric modulators. In simulations of known allosteric 
modulators (C7/3-phth [heptane-1,7-bis(dimethyl-3’-phthalimidopropylammo- 
nium)], dimethyl-W84 [hexane-1,6-bis(dimethyl-3'-(4-methyl-phthalimido)propyl 
ammonium)], gallamine, strychnine, alcuronium, or hexamethonium), the new 
designed modulator 4P-C;/3-phth [heptane-1,7-bis(dipropyl-3’-phthalimidopropyl 
ammonium)], or hypothetical modulators (4I-C,/3-phth [heptane-1,7-bis(diisopropyl- 
3'-phthalimidopropylammonium)] or 4C-C;/3-phth [heptane-1,7-bis(dicyclopentyl- 
3'-phthalimidopropylammonium)]), one modulator molecule was placed at an 
arbitrary location in the aqueous region, more than 20 A from the centre of the 
extracellular vestibule, and allowed to diffuse freely until it associated sponta- 
neously with the receptor, as we described previously for B-adrenergic receptors’. 
No artificial forces were applied to the ligand. Initial conformations of strychnine 
and alcuronium were based on crystal structures of strychnine and dimethyl 
caracurine-II (ref. 40), respectively. 

Simulation of tetraethylammonium. To probe the relative importance of centre 
1 and centre 2 in ligand binding, we used tetraethylammonium (TEA), a simple 


molecule containing a single quaternary ammonium group. We performed a 5.0-p1s 
simulation of one TEA molecule binding spontaneously to an unliganded M2 
receptor, following the protocol described above (Supplementary Table 2, con- 
dition AN). TEA bound and dissociated rapidly, allowing estimation of its relative 
affinity for the two centres (Supplementary Fig. 13). 

Design and simulation of mutated receptors. Before performing the radioligand 
binding experiments described below (see section ‘Pharmacological assays’), we 
performed simulations of C,/3-phth binding to several M2 receptor mutants we 
had designed—on the basis of wild-type receptor simulations—to increase or decrease 
allosteric modulator binding affinity. These simulations (conditions N-AB; see 
Supplementary Table 2), were performed as described above. The mutated residues 
were modelled using the PYMOL mutagenesis wizard". In simulations containing 
N419W, N410Y, T84D and/or T423E mutations (conditions N-P and Z)— 
mutations designed to increase modulator affinity—C,/3-phth adopted the same 
pose as it did when binding to the wild-type receptor, and made the predicted 
favourable interactions (cation—n or cation—anion) with the mutated residues. In 
the simulation of the N410K + T423K mutant (condition Q)—designed to decrease 
modulator affinity— C;/3-phth never associated with the receptor. In simulations 
of the Y80K and Y83K mutants (conditions AA and AB), C;/3-phth associated 
with the receptor only transiently, never reaching a stable bound conformation. 
Simulations of several other mutants designed to increase modulator affinity 
yielded inconclusive results, typically because C7/3-phth adopted a different bind- 
ing pose than observed in wild-type simulations (conditions R-V). We thus chose 
to express and perform radioligand binding experiments on those mutants shown 
in Fig. 2. 

To probe ligand-receptor interactions at centre 2 further, we also performed 
simulations of C7/3-phth binding to the Y80A, L82A and Y83A mutants (condi- 
tions W-Y, respectively; cf. Supplementary Fig. 14). 

Design and simulation of C;/3-phth analogues. We sought to design a C;/3- 
phth analogue that would maintain the same binding pose but require a wider 
conformation of the extracellular vestibule. For each of several candidate analo- 
gues, we performed de novo binding simulations to an NMS-bound M2 receptor 
(Supplementary Table 2, conditions AH, AK, AL), to test whether the candidate 
would bind in a pose analogous to that of C;/3-phth. (These were sometimes 
supplemented by simulations of the NMS-occupied receptor with the candidate 
initially modelled into the extracellular vestibule in the C7/3-phth pose (Supplemen- 
tary Table 2, conditions AF, AI).) We also performed simulations of the NMS-free 
receptor with each candidate initially modelled into the extracellular vestibule in 
the C;/3-phth pose (Supplementary Table 2, conditions AG, AJ, AM), to test 
whether the candidate would maintain a wide vestibule conformation. 4P-C,/3- 
phth satisfied all criteria (Fig. 3d and Supplementary Fig. 19) and was thus selected 
for synthesis and experimental evaluation. Other candidates failed our computa- 
tional tests, in that they either bound to the NMS-free receptor without widening 
the vestibule or failed to bind in a pose similar to that of C;/3-phth. 

Force field parameters. We used the CHARMM27 parameter set for protein 
molecules and salt ions and the CHARMM TIP3P model for water*’; protein 
parameters incorporated CMAP terms” and modified Asp, Glu and Arg side-chain 
partial charges*’. We used a modified CHARMM lipid force field“. Ligand para- 
meters were based on results from the CHARMM ParamChem web server, version 
0.9.6 B (ref. 45), and incorporated specific modifications to partial charges, dihed- 
ral terms and angle terms, as shown in Supplementary Fig. 24. When ParamChem 
reported large penalties (errors) for its estimated partial charges, better estimates 
were determined by submitting appropriately chosen fragments or analogues to 
ParamChem (Supplementary Fig. 24). When large penalties were reported for a 
dihedral term, a new term was created by fitting to high-level DF-LMP2/aug-cc- 
pVTZ ab initio quantum mechanical calculations from a locally tuned version of 
MOLPRO version 2010.1 (H.-J. Werner et al., Cardiff University, 2010). Force field 
parameters for all ligands are available upon request. 

Because commonly used molecular dynamics force fields are known to model 
cation-1 interactions poorly**, we modified certain force field terms to better 
represent cation-7 interactions involving simulated ligands. We began by char- 
acterizing the error that results from using standard force field parameters to 
determine the aqueous benzene-tetramethylammonium (TMA) association con- 
stant. We simulated one benzene molecule and two TMA molecules in 150 mM 
NaCl for 300 ns. Benzene was parameterized using the CHARMM phenylalanine 
parameters; TMA was parameterized using the ParamChem server. The benzene- 
TMA association constant (K,) was determined on the basis of the fraction of time 
in which the benzene formed a complex with either TMA. Although a simple 
model based on experimental data predicts a Ka of about 1.8 M |! (ref. 47), our 
simulation yielded a value of only 0.67 M_', corresponding to a binding energy 
discrepancy of approximately 0.6 kcal mol! (standard force field parameters led to 
weaker association than is observed experimentally). We strengthened benzene- 
TMA association by increasing the ¢ parameter for the van der Waals (Lennard-Jones) 
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interactions between carbon atoms of benzene and carbon atoms of TMA from 
0.0734 kcal mol”! to 0.175 kcal mol! (the ¢ parameter, 3.8 A, was not changed). 
Simulations with these modified parameters yielded a more realistic Ka of 
1.86 M"!. We used this modified C-C « parameter for van der Waals interactions 
between those ligand carbon atoms directly bonded to quaternary ammonium 
nitrogen atoms and the carbon atoms of all aromatic rings in the M2 receptor. 

Because strychnine has a tertiary, not quaternary, ammonium group, we also 
developed an alternative correction involving modification of the van der Waals 
interaction between the ligand ammonium nitrogen atom and carbon atoms of 
aromatic residues. In this case, we modified the ¢ parameter from 0.118 kcal mol ~ 
to 0.385 kcal mol! (the ¢ parameter, 3.4 A, was not changed). Simulations with 
these modified parameters yielded a benzene-TMA K, of 1.83 M_*. We used this 
C-N « parameter modification in simulations of strychnine. As a control, we also 
simulated N-methylstrychnine (which has a quaternary ammonium group but has 
properties similar to those of strychnine) with the C-C ¢ parameter modification 
and verified that its behaviour in simulation was similar to that of the strychnine 
simulations. Likewise, we confirmed that C;/3-phth and dimethyl-W84 simula- 
tions using the C-C ¢ parameter modification (condition A, simulation 5; con- 
dition B, simulation 5; condition J, simulation 3; condition K, simulation 3; see 
Supplementary Table 2) yielded behaviour matching that of simulations using the 
C-N ¢ parameter modification (all other simulations under conditions A, B, J and 
K). We used the C-N ¢ parameter modification in conditions C, D, L-V and AE 
(see Supplementary Table 2), and the C-C ¢ parameter modification in all other 
simulations. 

Analysis protocols. Trajectory frames (atomic coordinates) were saved every 
180 ps during production simulations. VMD was used to visualize trajectories’’, 
and molecular images were rendered using PYMOL" and POV-Ray (Persistence of 
Vision; http://www.povray.org). The ammonium group locations shown in Fig. le 
were sampled at 1-ns intervals from 1.0-|1s simulation windows. Cross sections in 
Fig. 3c, dare ona plane through Y177"“ and N419”” perpendicular to the lipid bilayer. 

The root mean squared deviation (r.m.s.d.) of ligand atomic positions was 
calculated by first aligning the receptor transmembrane helices (Co. atoms of 
residues 20-49, 57-86, 93-126, 138-166, 184-213, 383-412 and 419-439), and 
then computing the r.m.s.d. of non-hydrogen ligand atoms. For each ligand, the 
binding pose adopted in most simulations was declared the consensus pose. A 
single representative of this consensus pose was used to compute r.m.s.d.; this 
representative pose is shown in all figures, unless otherwise indicated. All r.m.s.d. 
traces were smoothed using a 10-ns running average. 

An estimate of the energetic contribution of electrostatic repulsion to cooperativity— 
the difference between the allosteric modulator’s electrostatic binding energy in the 
presence and absence of NMS—was determined as the sum, over all modulator 
atoms, of the atomic partial charge multiplied by the receptor electrostatic differ- 
ence potential (NMS-bound minus unliganded) evaluated at the position of each 
modulator atom. Electrostatic potentials were determined by solving the linearized 
Poisson—Boltzmann equation at an ionic strength of 150 mM using APBS”; only 
protein and (when present) NMS atoms were included in this calculation. For each 
allosteric modulator, this calculation was performed for 10 frames evenly spaced 
over that portion of a trajectory during which the modulator was bound. The 
average of these sums is the electrostatic energetic contribution for that modulator 
in that particular trajectory, and each of the values in Supplementary Table 4 
represents a further average over several trajectories. For the PAMs (alcuronium 
and strychnine), we used trajectories with NMS bound in the orthosteric site; NMS 
was removed before ABPS calculation of the ‘unliganded’ electrostatic potentials. 
For the NAMs (C,/3-phth, dimethyl-W84 and gallamine), we used trajectories in 
which the orthosteric site was unoccupied; before ABPS calculation of the ‘NMS- 
bound electrostatic potentials, NMS was docked manually into the orthosteric site, 
and receptor atomic positions were minimized to remove steric clashes. 
Docking protocol. For purposes of comparison, we performed flexible ligand 
docking using Glide (Schrodinger), using the same human M2 receptor structure 
as the modulator binding molecular dynamics simulations described above. 
Modulators were allowed to dock in a 25-A box centred on the centroid of residues 
E172°_B 175°, y177"", N419”*?, W422” and T423’°°, which were 
selected on the basis of mutagenesis data suggesting the general location of the 
allosteric binding site'!*'*°°°!, This docking box contained, and indeed was 
roughly centred on, the binding poses observed in molecular dynamics simula- 
tions. The results achieved with a 25-A box did not differ qualitatively from the 
results of control docking experiments using a 15-A box. Each ligand was docked 
in Glide, using standard-precision docking with the default settings. Ten docked 
poses for each ligand were obtained, and these docking predictions were compared 
with the pose obtained by molecular dynamics (Supplementary Fig. 21). Control 
docking experiments with extra-precision (XP) settings did not produce results 
more similar to those obtained by molecular dynamics simulation. This same 
procedure was also used to dock modulators to receptor coordinates taken from 
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a molecular dynamics simulation of the unliganded M2 receptor (condition AD; 
see Supplementary Table 2). 

Pharmacological assays. Reagents were obtained from the following vendors: 
Dulbecco’s modified Eagle’s medium (DMEM), hygromycin-B and CHO-FlpIn 
cells (Invitrogen); fetal bovine serum (FBS; Thermo Electron); (H]NMS 
(>82.0Cimmol '; PerkinElmer Life Sciences); oligonucleotides (Geneworks). 
Mycoplasma testing was performed regularly on cell lines using the MycoAlert™ 
kit (Lonza); cell lines were mycoplasma-free before and after experiments were 
conducted. All other reagents, purchased from Sigma, were analytical grade. 
Mutagenesis and stable cell line generation. Site-directed mutagenesis of the 
human M2 receptor was performed in the pENTR/D-TOPO vector before transfer 
to the pEFS/FRT/V5-DEST vector, as previously described'**’, to produce a recep- 
tor with a V5 epitope tag at the carboxy (C) terminus. The V5-tagged wild-type 
receptor has been shown to possess pharmacological properties equivalent to those 
of the untagged receptor. Receptor constructs were integrated isogenically into 
CHO-FipIn cells, as previously described". Stably transfected cell lines were main- 
tained in DMEM supplemented with 10% FBS and 600 jig ml’ hygromycin-B 
(‘complete DMEM’) at 37 °C in a humidified incubator containing 5% CO, 95% 
O). A list of mutagenesis primers used can be found in Supplementary Table 8. 
Radioligand binding assays. Radioligand binding experiments were performed 
on CHO-FipIn whole cells stably expressing the human M2 receptor construct of 
choice. After plating 20,000 cells in complete DMEM into 96-well ISOPLATE TC 
plates (all amounts are per well), cells were allowed to grow overnight at 37 °C. 
The next day, cells were washed with phosphate-buffered saline (100 pl) and re- 
suspended in binding buffer (10 mM HEPES, 100 mM NaCl, 10mM MgCh, pH 
7.4). Assay mixtures, in a total volume of 200 il with a 1/10 dilution of drug, were 
incubated at room temperature (22 °C) for 4h, except for the comparison between 
C,/3-phth, 4P-C,/3-phth and alcuronium at the wild-type receptor, which was 
performed over 24h. Assays were terminated by buffer removal followed by rapid 
washing, twice, with ice-cold 0.9% NaCl (100 ul). Plates were allowed to dry 
inverted for 30 min; OptiPhase Supermix scintillation cocktail (100 pul) was added, 
plates were sealed (TopSeal™) and radioactivity was measured in a MicroBeta” 
LumiJET microplate counter. Saturation binding experiments were performed 
in the absence or presence of atropine (100 1M) with 0.003-3 nM (H]NMS. 
Inhibition binding experiments were performed with 0.2nM [*H]NMS (the 
approximate K,) in the presence of various concentrations of C;/3-phth, galla- 
mine, 4P-C,/3-phth or alcuronium. For [*H]NMS dissociation kinetics experi- 
ments, cells were first incubated with the radioligand at room temperature (22 °C) 
for 4h before initiating radioligand dissociation, using atropine (100 LM) either in 
absence or presence of a high concentration (100 1M) of a modulator (either C7/3- 
phth or gallamine). 

The number of replicates in each group was chosen according to standard 
practices in the field. All experiments used n = 3, except for certain measurements 
of [>HJNMS dissociation rate and allosteric modulator affinity and cooperativity, which 
used n = 4 orn = 5 (see Supplementary Fig. 11 and Supplementary Tables 6 and 7). 

As a control, we repeated several experiments in a buffer that did not contain 
magnesium ions. In particular, we removed MgCl, from the buffer described 
above and performed inhibition binding experiments for C7/3-phth at the wild- 
type, N410Y, T84D, N410Y + T423E and N410K receptors. All C;/3-phth affinity 
values increased by a factor of 2-4 in the absence of magnesium, but each mutation 
still led to approximately the same change in C,/3-phth affinity (data not shown). 
Data analysis. Nonlinear regression was performed using Prism 5.04 (GraphPad 
Software). Total and non-specific CPHINMS binding data were globally fitted to a 
one-site saturation binding model to derive estimates of the radioligand dissoci- 
ation constant (Kg) and the maximal density of binding sites (Bax) for the wild- 
type receptor and each mutant (Supplementary Fig. 10). Dissociation kinetics data 
for all mutant receptors were fitted to a monoexponential function to derive 
observed dissociation rate constants (ko, Supplementary Fig. 11). With two 
exceptions (noted below), all interaction binding data were fitted to a kinetic 
allosteric ternary complex model (kinetic ATCM) to accommodate any potential 
non-equilibrium binding under our experimental conditions. For this latter ana- 
lysis, the equations are as derived previously by Lazareno and Birdsall*, with the 
only modification being that the affinity constants in the original equations were 
re-cast as equilibrium dissociation constants”: 


B, =Bap|1—e7 no] (1) 
where 


Kon,obs — Kote,obs (1 [A] / Kapp) (2) 


1+ (BI/Ki 
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kos + [B] Kott,z /(Ki/) 


Kote,obs ea [B]/(K;/x) (4) 
and / 
__ [A]/Ka 
7H TAT Raw | ‘ 


In these equations, [A] denotes the concentration and K, the equilibrium dissoci- 
ation constant of the orthosteric radioligand; [B] denotes the concentration and K; 
the equilibrium dissociation constant for the allosteric modulator; « is the coop- 
erativity factor for allosteric interaction between radioligand and modulator; Kapp, 
denotes the apparent equilibrium dissociation constant of the orthosteric radioli- 
gand in the presence of modulator; B, denotes the fractional binding of the radi- 
oligand at time f; Bap denotes the fractional binding of the radioligand in the presence 
of modulator at equilibrium; kon,ops and kofeops denote the apparent association and 
dissociation rate constants for the radioligand in the presence of a modulator, 
respectively; k denotes the radioligand dissociation rate constant when the receptor 
is not occupied by modulator; and kop denotes the radioligand dissociation rate 
constant for the modulator-occupied receptor; this latter value was not signifi- 
cantly different from zero, and was constrained as such. This analysis assumes that 
the binding kinetics of each modulator are rapid relative to the radioligand and that 
the modulator rapidly achieves equilibrium with the allosteric site, as is generally 
found for prototypical modulators of the muscarinic acetylcholine receptors’. 

For the Y80K and N410K + T423K mutants, the affinity of the modulators was 
too weak to allow for a reliable fit of the full kinetic ATCM (the binding curves 
could not be fully defined). Thus, to obtain an estimate of the affinity of each 
modulator under these conditions, we fitted the model to each data set with the 
cooperativity parameter « restrained to each of 11 values ranging from 1 to 0 in 
0.1-unit increments. All fixed values of « yielded similar affinity estimates: esti- 
mated pK; ranged from 4.64 to 5.06 for C,/3-phth at Y80K, from 3.12 to 3.52 for 
gallamine at Y80K, from 3.88 to 4.21 for C;/3-phth at N410K + T423K and from 
3.32 to 3.64 for gallamine at N410K + T423K. Thus, although we cannot estimate 
a with confidence for these mutants, we can derive reasonably accurate estimates 
of affinity. In each case, the value of « yielding the minimum mean squared error 
for each curve fit (0.5 and 0.7 for C,/3-phth at Y80K and N410K + T423K, 
respectively; 0.4 and 0.3 for gallamine at Y80K and N410K + T423K, respectively) 
was used to determine the estimate of modulator affinity shown in Fig. 2d. The 
error bars for Y80K and N410K + T423K represent a sum of the inherent error of 
the pK; estimate with this fixed « value and the change in pK; estimates that 
resulted from setting « to the most extreme values. 

To determine statistical significance of changes in allosteric modulator affinity 
(pK;) values for mutant receptors, we used a standard one-way analysis of variance 
(ANOVA) test with an unprotected Fisher’s least significant difference post-test; 
this test was one-tailed in cases where we predicted an affinity change in a specific 
direction (Fig. 2d) and two-tailed otherwise (Supplementary Fig. 14). We used the 
same one-tailed test to determine statistical significance of changes in cooperati- 
vity («) for different modulators at the wild-type M2 receptor (Fig. 3d). To deter- 
mine the significance of changes in the affinity (pK;) of those different modulators 
to the wild-type M2 receptor and of changes in the % Bmax and PHINMS pKy 
values for mutant receptors (cases where we had no previous prediction of a change 
or of directionality of change), we used ANOVA with a two-tailed Dunnett’s post- 
test (Supplementary Figs 10, 12 and 20). The statistical tests used make standard 
assumptions of data normality and equality of variance. The data did not show 
evidence of significant deviation from a normal distribution, and F-tests did not 
reject the hypothesis of equality of variance for all groups. 

Synthesis of the allosteric modulator 4P-C;/3-phth. The allosteric modulator 
4P-C,/3-phth was prepared in three steps, as follows. (1) Pimelic acid was acti- 
vated with oxalyl chloride; the resulting acid chloride was then coupled with N,N- 
dipropylamine to afford N,N,N',N’-tetrapropyl heptanediamide. (2) The diamide 
was reduced with borane/THF or LiAlH, to afford N,N,N’ ,N'-tetrapropyl heptane- 
1,7-diamine. (3) The diamine was quaternized with 2-(3-bromopropyl)-isoindoline- 
1,3-dione (ethylene carbonate solvent, sealed tube, 90 °C, 60 h, under N32) to give, 


after HPLC purification, the target compound “4P-C,/3-phth’, N,N’ -bis-(3-(1,3- 
dioxoisoindolin-2-yl)propyl)-N,N,N’ ,N’ -tetrapropyl heptane-1,7-diaminium bis- 
trifluoroacetate tetrahydrate. Additional detail on compound synthesis and 
characterization is provided in the Supplementary Information (‘Materials and 
methods for organic synthesis’ and Supplementary Fig. 25). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel2795 


Corrigendum: Stereoinversion of 
tertiary alcohols to tertiary-alkyl 


isonitriles and amines 
Sergey V. Pronin, Christopher A. Reiher & Ryan A. Shenvi 


Nature 501, 195-199 (2013); doi:10.1038/nature12472 


In Fig. 3 of this Letter and on pages SI-24 to SI-28 and SI-84 to SI-93 of 
the Supplementary Information, the Me and n-Pr labels should be 
reversed in compounds 17 to 21. Corresponding errors also affect the 
precursor alcohols 42, 44, 46, 47 and 48 on pages SI-24 to SI-28. The 
conclusions of the paper are not affected by these errors, which have 
been corrected in the online HTML and PDF versions of this Letter. 
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The great outdoors 


The Rocky Mountain Biological Laboratory field station outside Crested Butte, Colorado, offers high-tech facilities for fieldwork. 


Field stations offer sophisticated facilities and opportunities for large-scale research. 


BY ROBERTA KWOK 


oah Whiteman’s 2011 field season 
Ne tough. He and his team spent two 

summer months in the Rocky Moun- 
tains studying whether bacterial infections 
made plants more vulnerable to herbivores. 
They wanted to isolate bacteria from col- 
lected leaves to infect plants in the field, but 
the station at which they were working, the 
non-profit Rocky Mountain Biological Labo- 
ratory (RMBL) near Crested Butte, Colorado, 
did not have the equipment to support sterile 
laboratory work. 

So two to three times every week, the team 
drove an hour each way to Western State Col- 
orado University in Gunnison to autoclave 
nutrient media and pour it into Petri dishes 


ready for growing bacteria. They were grate- 
ful for the facilities, but the process “was really 
cumbersome’, says Whiteman, an ecologi- 
cal geneticist at the University of Arizona in 
Tucson. “We were exhausted.” 

But by Whiteman’s 2012 field season, the 
RMBL had built a new research centre — with 
Bunsen burners, microfiltered water and fume 
cupboards for chemical work — mainly with 
funding from the 2009 stimulus package from 
the US federal government. The facilities got 
even better in 2013, when funding from the US 
National Science Foundation (NSF) and private 
donors allowed the station to add an autoclave, 
a shaking incubator, a polymerase-chain- 
reaction machine and a —80°C freezer. The 
team could now store more plant and bacteria 
samples and process them much more quickly. 


Many field stations used to offer biolo- 
gists little more than access to the land, basic 
equipment such as microscopes anda place to 
sleep. But over the past decade or so, stations 
around the world have begun adding more 
sophisticated features: molecular-biology 
equipment, Wi-Fi, Global Positioning System 
(GPS) devices and features ranging from tow- 
ers that allow researchers to monitor the forest 
canopy to facilities for conducting large-scale 
lake experiments. The upgrades, often funded 
by government grants, are driven partly by the 
falling cost of technology. Meanwhile, there is 
growing scientific interest in complex, large- 
scale research questions — including projects 
on the effects of climate change, invasive spe- 
cies and pathogens across entire regions. To 
support this work, programmes such as the P 
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> NSF-funded US National Ecological Obser- 
vatory Network (NEON), based in Boulder, 
Colorado, are collecting standardized ecologi- 
cal and atmospheric data across whole coun- 
tries — a far cry from the simple collection of 
flora and fauna that once characterized field- 
station research. “Back in the day, you were just 
grabbing creatures,” says Sarah Oktay, direc- 
tor of the Nantucket Field Station operated by 
the University of Massachusetts Boston. Now, 
she says, scientists are more interested in big 
questions relevant to entire regions. 


HIGH-TECH FACILITIES 

Upgraded Internet access at some stations 
allows researchers to upload data automati- 
cally from field sensors to university servers or 
data repositories. In 2006, with funding from 
the NSE, the Nantucket Field Station installed 
Wi-Fi over about 80% of its 43-hectare site, 
giving scientists online access to sensors to 
track species of interest (one team installed 
cameras to record videos of fiddler crabs) or 
monitor temperature, light or soil moisture. 
That speeds up access to data and makes it 
easier to find out whether a sensor has broken 
down. Otherwise Oktay might “have to put on 
a pair of waders and drive about 20 miles” to 
discover a broken sensor, she says. 

Improved mapping technology allows 
scientists to collect precise location data. In 
2004, the RMBL began offering handheld 
GPS units to visiting researchers. At the time, 
the units were able to pinpoint locations to 
within 100 centimetres; 2 years ago, with 
access to more satellite data, they could reach 
20-40 centimetres. The station maintains a 
database of spatial information such as eleva- 
tion data and locations of roads and species, 
so that researchers can look for correlations 
between their observations and other factors, 
for example between animals’ adrenaline levels 
and the creatures’ distance from recreational 
trails. And scientists conducting long-term 
studies can be confident that they are returning 


to the same spots year after year, instead of 
relying on physical markers that might be 
moved or buried by wildlife. 

Some field stations have installed especially 
sophisticated experimental facilities. In 2008, 
with NSF funding, the La Selva Biological Sta- 
tion in northern Costa Rica, managed by the 
non-profit Organization for Tropical Studies 
in Durham, North Carolina, completed 3 tow- 
ers between 33 and 41 metres high that were 
equipped with electricity and Wi-Fi, allowing 
easy access to the forest canopy. The towers 
were constructed to make the station more 
attractive to researchers and educators, says 
station director Carlos de la Rosa. A robot 
shuttles between towers on cables and collects 
data on surface reflection, solar radiation and 
sound; weather stations gather meteorological 
data at various heights. Researchers can study 
differences between ecosystems on the ground 
and those high above the forest floor, such as 
variations in the types of beetle present. One 
team, says de la Rosa, is considering record- 
ing butterflies with cameras at different levels, 
then streaming the footage to the Internet. The 
researchers could then recruit members of the 
public to watch the videos and help to identify 
species, saving researchers many hours of view- 
ing time. 


WATER WORK 

Studies of aquatic ecosystems are also benefit- 
ing from better equipment. Last year, the Leib- 
niz Institute of Freshwater Ecology and Inland 
Fisheries in Neuglobsow, Germany, set up its 
LakeLab facility to improve studies on the 
effects of climate change. Funded mainly by a 
grant from the Federal Ministry of Education 
and Research, the system consists of 24 cylin- 
drical enclosures in Lake Stechlin, inside which 
scientists can simulate changes in the depth of 
the upper level of warm water, and measure the 
effects on flora and fauna. Researchers from 
Germany and Hungary are studying how these 
changes affect ciliates and algae, for example. 


INTO THE WILD 


Planning is the key to success 


The basics of planning a successful visit to 
a field station have not changed. Be sure 

to book well in advance: some stations can 
fill up a year ahead. Consider whether the 
weather will be conducive to the work. “I’ve 
had people come and look for clams, and 
I’m like, well, they’re going to be under a 
foot of ice,” says Sarah Oktay, director of 
the Nantucket Field Station operated by the 
University of Massachusetts Boston. 

Let the director know what equipment is 
needed and bring back-up electronics. If the 
station is not near civilization, don’t count on 
being able to buy extra batteries. And avoid 


planning a packed schedule; researchers 
often underestimate how long fieldwork will 
take, and have to leave without enough data. 
Once at the site, be open to opportunities 
and conversation, says lan Billick, director of 
the Rocky Mountain Biological Laboratory 
near Crested Butte, Colorado. Visiting 
another scientist’s field site can foster 
rich discussions about the complexities of 
the surrounding environment and lead to 
new research directions — an experience 
sometimes lacking in the laboratory, where 
the focus tends to be limited to a few 
processes of interest. R.K. 
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Jorg Sareyka (left) and Mark Gessner collect 
zooplankton samples at LakeLab, Germany. 


The enclosures are large — about 20 metres » 
deep and 9 metres wide — reducing the influ- ~ 
ence of the walls and allowing experiments to 
better mimic natural conditions, says Mark 
Gessner, director of LakeLab. 

The advantages offered by modern field sta- 
tions go beyond technology: researchers stand 
to gain from many years of collective wisdom. 
Ecological studies that began in the 1960s and 
1970s have laid the groundwork for a new gen- 
eration of researchers, says Ian Billick, direc- 
tor of the RMBL. Today, field stations provide 
accumulated intellectual capital: decades of 
original data sets, archived research plans, spec- 
imen collections and oral history that can prove 
invaluable to young investigators. A researcher 
who wants to know the best place to find a par- 
ticular flower species can easily get tips from a 
field-station staff member or a colleague who 
has worked at the site, instead of sifting through 
published papers. “That background informa- 
tion is critical for allowing new scientists to get 
projects up and running quickly,” says Billick 
(see ‘Planning is the key to success’). 


GOING BIG 

Field-station directors often coordinate with 
each other and share data, allowing scientists 
to conduct large-scale studies. Over the past 
10-20 years, the non-profit Organization of 
Biological Field Stations (OBFS) in Woodside, 
California, has worked for increased commu- 
nication, says Oktay, who is secretary of the 
organization. Directors are also keen to sup- 
port individual researchers. For example, if a 
scientist approaches the OBFS with an interest 
in studying a moth parasite in New England, 
the organization will e-mail all field-station 


directors in the region to find out who has 
collected that parasite or moth, helping the 
researcher to decide where to visit. 

Many stations are putting their data 
online, making it easier for scientists to 
spot trends. Black Rock Forest, a field sta- 
tion in Cornwall, New York, already has 
weather and other environmental sensor 
data posted on a project website and plans 
to add hydrology and tree-growth data, as 
well as all of its research publications, by 
next year, says Bill Schuster, the station’s 
executive director. 

Federally funded programmes are gath- 
ering data on larger scales, and making 
them accessible to all researchers. NEON 
aims to collect ecological observations 
at 106 sites across the United States for 
30 years. It has installed basic infrastructure 
such as sensor towers and instrument huts 
at 24 sites, and hopes to complete all sites 
by 2017, at a cost of $434 million. Sensors 
will monitor features such as atmospheric 
conditions and soil properties at terrestrial 
sites, and pH, oxygen and nutrient levels 
in streams; field crews will study organ- 
isms and fly instrumented planes to take 
high-resolution aerial images and survey 
vegetation. In Australia, the Terrestrial Eco- 
system Research Network (TERN) based in 
St Lucia performs a similar function, gath- 
ering data on greenhouse-gas exchange, soil 
characteristics, bioacoustics and weather at 
ten sites across the country. 

NEON and TERN each collect their data 
in a standardized way so that researchers 
can make comparisons on regional or conti- 
nental scales. Without that consistency, “you 
literally cannot say what is happening across 
the country’, says Suzanne Long, the execu- 
tive knowledge broker at TERN, who works 
to increase the network’s impact on environ- 
mental policy and practice in Australia. 

The data could complement work at spe- 
cific field sites, suggests Dave Tazik, director 
of biology at NEON; the observatory’s sites 
could serve as control plots while research- 
ers experimentally manipulate other plots. 
“We think of it as providing a backbone of 
data,” he says. Scientists can also apply to 
add their own sensors to NEON towers. 
Researchers could develop a hypothesis 
on the basis of their field-station work — 
perhaps investigating how plants respond 
to climate change — and then analyse the 
freely available NEON data to explore 
whether trends hold true on a broad scale. 

Now more than ever, field stations are 
helping scientists to tackle big questions 
about pressing environmental issues, such 
as the effect of climate change on ecosystems. 
“As these systems fall apart, it’s actually a very 
good time to study them,’ says Billick. m 


Roberta Kwok is a freelance science writer 
in Seattle, Washington. 


COLUMN 


Fast-tracked talent 


Expediting visa approval helps countries to attract the 
best researchers, says Conor O’ Carroll. 


ncouraging and facilitating mobil- 
Be among scientists helps both the 

researcher and the country thirsty for 
talent. Indeed, the success of the €70.2-billion 
(US$95-billion) Horizon 2020 European 
Union (EU) research programme, set to be 
launched on 11 December, hinges on bringing 
researchers to Europe from around the globe. 
And yet obtaining a visa, the key to being 
mobile, can be a major challenge. Ireland’s 
experience with smoothing the visa process 
shows one way to make the system more effi- 
cient while maintaining border security. 

The EU has 28 member states, and negotiat- 
ing their immigration procedures can be dif- 
ficult. Eliminating differences has been a core 
part of European Research Area policy, and in 
2005 Europe-wide legislation was introduced 
to ensure fast-track immigration for interna- 
tional researchers through the ‘scientific visa. 
All EU countries except Ireland, the United 
Kingdom and Denmark were obliged to imple- 
ment this fast-track visa, but the system has 
been put in place with varying levels of enthu- 
siasm and effectiveness. Ireland recognized the 
potentially positive impact of the scientific visa 
and voluntarily introduced it in 2007 to attract 
more researchers. 

Ireland's largely successful system can serve 
as a model for other nations. Although Ire- 
land aspires to be a research hub, its economy 
remains fragile, and science funding is tight. 
Ireland cannot afford to stymie the movement 
of talent with cumbersome visa protocols. 

We use a secure online verification system 
that links immigration services with embas- 
sies around the world, and we have a simple 
application procedure to reduce bureaucracy. 
The online system means that after applicants 
obtain research jobs in Ireland, they can get 
visas for themselves and their families within 
days, rather than the previous six to eight 
weeks for the scientists and up to a year for 
their families. Scientists used to have to pay 
€1,000 (US$1,300) per year for a work per- 
mit; it is now free. Ireland implements the 
visa through its EURAXESS office — one of 
200 offices across Europe providing advice 
and support for mobile scientists — based in 
Dublin, at the Irish Universities Association (of 
which I am the research director). 

Over the past 6 years, 1,750 researchers from 
80 countries have come to Ireland using the 
fast-track scientific visa. Half of them came 


from China, the United States and India. How 
do we know that the changes have made a real 
difference? In March, EURAXESS Ireland car- 
ried out a survey of more than 300 researchers 
with scientific visas, the first such survey by any 
European country participating in the scheme. 
Of those surveyed, 84% were still working in 
Ireland; 27% had found employment in the 
areas of information and communications tech- 
nology, and 26% in computer and life sciences. 

Fifty-three per cent of the respondents said 
that the fast-track visa was very important when 
deciding to proceed with a job in Ireland. The 
survey also revealed that 23% of researchers 
would definitely not have come to Ireland if the 
fast-track visa scheme had not been in place. 

Policymakers and those keen on building a 
nation’s scientific reputation often assume that 
facilities and staff are the crucial factors for 
drawing in researchers. However, our survey 
shows clearly that the immigration process is 
key to decision-making. The story is likely to 
be the same for other countries that are work- 
ing to increase their scientific excellence by 
attracting researchers. 

Competing for international scientific tal- 
ent is a challenge. Nations such as Ireland that 
have invested heavily in science only in the past 
decade have to compete against international 
leaders including the United States and the 
United Kingdom. Providing an easily accessi- 
ble, fast-track visa can make a real difference. m 


Conor O’Carroll is research director at the 
Irish Universities Association in Dublin and 
chair of the European Steering Group on 
Human Resources and Mobility. 
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RUNNER: OORKA/SHUTTERSTOCK; FLAG: REDKOALA/SHUTTERSTOCK 


UU asm SCIENCE FICTION 


BOTTLED UP 


BY ANATOLY BELILOVSKY 


CC ou won't like it,” 
said the cork- 
screw. “A most dis- 


appointing product, not what I 
expect from a Grand Cru winery at 

all. Hardly any nose to speak of, and 
the tannins are totally unbalanced. 
You really should pull me out of 

this cork, lay the bottle down for 

afew more years, and make your- 

self a nice cup of tea” 

Hed made it for her once and 
shed pretended hed got it right 
but he knew he hadn't. The tea she 
made herself was so much better, its 
aroma suffused the whole house. She 
would sit in that enormous old chair in 
her comfy old sweater two sizes too big, 
feet tucked under her, nose in the teacup, 
eyes half shut behind steamed-over glasses, 
her smile would light up the room more than 
the fireplace ever did, and he would just sit 
and look at her for hours on end. 

He might be looking at her still if she 
werent dead. 

“Just pop it,” he said through his teeth, 
eyes blurring, hands beginning to shake. 

“Are you going to chug the wine?” asked 
the corkscrew. 

He looked around, opened a cabinet, 
selected a glass. 

“That’s a white-wine glass,” said the cork- 
screw. 

“So?” 

“The volatile scents won't be properly 
concentrated,” said the corkscrew. “It will 
interfere with the proper enjoyment of the 
wine” 

“What are you, my mother?” he said. 

“No, Iam not your mother. Iam Smork- 
screw, a beverage-management solution 
equipped with artificial intelligence. Now, 
what food were you considering to accom- 
pany this wine?” 

He looked around again, pointed at a dish. 
“This?” he said. 

“Peanuts?” said the corkscrew. “That 
would be an insult even to this inferior 
excuse for a Valpolicella. Haven't you got any 
cheese? Fresh mozzarella would be perfect. 
Or turkey prosciutto and melon. Be sure to 

slice the prosciutto 


> NATURE.COM very thin —” 

Follow Futures: “She’s dead!” he 
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Hard to swallow. 


your —” he waved his hands “— whatever 
you use instead of head!” 

“I know she’s dead,” said the corkscrew. 
“It was my honour to open many a bottle for 
you and your guests as you sat shiva. Many 
people came” 

“Everyone loved her,’ he said. 

“She wasn't there,” said the corkscrew. 
“She was dead, remember?” 

“What’s your point?” he said. “Get it? 
Point?” He tried to laugh, but only a sob 
came out. 

“They came for you,” said the corkscrew. 
“Many people. They all love you.” 

“And now? Where are they now?” 

“A phone call away.’ 

“Screw you,’ he said. “Get it? Screw you?” 
He turned away, covered his face with his 
hands. “I want to be alone,” he whispered. 
“Just open the damn bottle” 

“She wouldn't like it,” said the corkscrew. 

“She is not here to not like it? he bit off. “She 
is not in this room. She is not in this house. 
She is’t anywhere on this Earth, anywhere 
under the Sun, anywhere except...” He 
clenched his fists. “Open the bottle, please. 
Just ... Pll get a proper glass, I'll make some 
sandwiches, I'll sit in her chair and cuddle her 
sweater, it won't be like I’m drinking alone, I'll 
drink to her and Pll drink to me, just open it” 
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“She's not in that bottle, either, said the 
corkscrew. 

“What makes you such a damn expert on 
her?” he screamed. 

“I aman expert system, you know,’ said 
the corkscrew. “Able to access the totality of 
information about the two of you through- 
out the World Wide Web, and to correlate 
its implications upon the problem at 
hand? 

“Would my medical history be part 
of that totality?” he asked quietly. 
“I must regretfully decline to 
divulge trade secrets of Smork- 
screw, LLC?’ said the corkscrew. 

“However, consider this: am — 

was — her present to you. The 

process of gift selection gave 

me an opportunity to perform a 

thorough assessment of her char- 
acter and motivations.” 

“What you mean is, she loved 
me, and she didn't want me to drink 
myself to death,” he said. “In case...” 
He trailed off. An old clock ticked off 
the seconds in the silence. The cherry tree 
shook in the wind beyond the window, shed- 
ding the last of its dry brown leaves (it had 
been white with bloom the day they packed to 
go on their second honeymoon, the sun had 
shone so bright, the shadows so crisp —). 

“She knew then, didn't she?” he asked. 

“Yes,” said the corkscrew. 

“She found out about the cancer, and 
then we went on the anniversary cruise, the 
best vacation ever, and she bought you for 
me as a present, and I thought it was cute 
and thanked her, but I had got her a pair of 
diamond earrings so it kind of irked me for 
awhile that all she got me was a stupid cork- 
screw and all I said was, ‘Cute; and then a 
week after we got back she told me she was 
going to die, and then she died and I never 
said I was sorry...” 

“Apology accepted,’ said the corkscrew. 

It wound itself from the cork and rolled 
towards its drawer, pausing at the coun- 
ter’s edge as he shook with sobs too long 
in coming. = 


Anatoly Belilovsky was born in what is 

now Ukraine, learned English from Star 
Trek reruns, worked his way through a US 
college by teaching Russian while majoring in 
chemistry, and has, for the past 25 years, been 
a paediatrician in New York, in a practice 
where English is the fourth most commonly 
spoken language. He blogs at http://loldoc.net. 
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